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TRANSLATOR’S PREFACE 

Students of what is usually termed “ General Relativity ” will need no reminder 
of the importance of Academician Y. Fock’s work on their subject. Up to now 
very little of this work has been available in English and it is a privilege to help 
to make it accessible to the English-speaking world. I am myself no specialist 
in this field and when asked to undertake the translation I approached the 
project with quite a measure of doubt. 

However, even the first cursory reading of Professor Fock’s book convinced 
me that the task would be not too difficult and both rewarding and pleasurable. 
So it proved to be. 

Professor Fock’s clear and precise style of exposition made my task an easy 
one. I found that I could translate almost literally, in striking contrast to some 
of my experience of Russian scientific writing. The only sections of the trans¬ 
lation, and there were not many, in which the Russian original is not followed 
almost line by line, are those in which Professor Fock himself provided an 
amended text, which, incidentally, he did in English so near perfect as to require 
practically no correction. 

There are a number of reasons why my task was rewarding. The earlier parts 
of the book give an exposition of so-called “ Special ” Relativity the merits of 
which I am more competent to judge than that of the later chapters. I found 
that I was transmitting a presentation of the greatest possible clarity and 
completeness with many an original turn to it, even in a field so often presented 
before. In subsequent chapters I naturally found more that was new and by the 
time I got to Professor Fock’s principal personal contributions to Relativity I 
found myself being guided through a most stimulating new field of thought with 
even the most lengthy and detailed calculation presented in a manner that 
never obscured the main argument. Where Professor Fock’s views are in dis¬ 
agreement with the opinions of others he states his case well—and his opponents 
have a formidable case to answer. 

The pleasure of my task lay, of course, partly in its scientific interest to which 
I have just referred, but, in addition, the work has brought me into a most happy 
contact with a great figure in theoretical physics and it gives me much satis¬ 
faction to testify to the ease of collaboration with Professor Fock and to the 
establishment of what I feel to be a strong personal bond. May it be a token of 
the greater bond of friendship between the scientists and between the peoples 
of our two countries. 

N. Kemmeb 

September 1958 
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TRANSLATOR’S PREFACE TO SECOND EDITION 
It has been a further pleasure to help in producing this second English edition 
of Professor Fock’s book. I acknowledge with gratitude the substantial help 
given to me by Professor Fock himself in providing a complete record of all 
changes made in the original Russian text. But for this aid the work of trans¬ 
lation would have been a much more lengthy one. 

N. Kemmeb 

September 1962 



PREFACE 

The aim of this book is threefold. Firstly, we intended to give a text-book on 
Relativity Theory and on Einstein’s Theory of Gravitation. Secondly, we wanted 
to give an exposition of our own researches on these subjects. Thirdly, our aim 
was to develop a new, non-local, point of view on the theory and to correct a 
widespread misinterpretation of the Einsteinian Gravitation Theory as some 
kind of general relativity. 

The Russian edition of this book* was published in 1955. In the present 
English edition only very few changes are made : some mathematical proofs are 
altered (Sections 8 and 31) and some reasonings are made more precise (Sections 
61 and 93). 

It is hoped that the publication of the present edition, by making the book 
accessible to English-speaking readers, will contribute to a wider discussion of 
the author’s ideas on Gravitation Theory. 

In conclusion the author wishes to express his thanks to Professor N. Kemmer 
for his excellent translation. 

August 1958 V. Fock 

Physical Institute of the 
Leningrad University, 

U.S.S.R. 


PREFACE TO SECOND EDITION 

It is a pleasure to me that my book has received sufficient attention to make it 
necessary to publish a second edition soon after the first. This edition follows 
closely the second Russian edition published in 1961. 

The second edition differs from the first by some additions and reformulations. 
The question of the uniqueness of the mass tensor is treated in more detail 
(Section 31*) and is illustrated by two examples (Appendices B and C). The 
notion of conformal space is introduced and used as a basis for the treatment 
of Einsteinian statics (Sections 56 and 57). Greatest care has been applied to 
the formulation of the basic ideas of the theory and to the elucidation of those 
points on which the author’s views differ from the traditional (Einsteinian) ones. 
Thus, in order to discuss the general aspects of the relativity principle Section 
49* has been added. 

The author’s views on the theory are explicitly formulated in different parts 
of the book and are implicit in the reasoning throughout the whole text. Their 
general trend is to lay stress on the Absolute rather than on the Relative. 
The basic ideas of Einstein’s Theory of Gravitation are considered to be: (a) the 
introduction of a space-time manifold with an indefinite metric, (b) the hypo¬ 
thesis that the space-time metric is not rigid but can be influenced by physical 
processes and (c) the idea of the unity of metric and gravitation. On the other 

* B. A. 4>ok, Teopna npocTpaHCTBa, BpeMemr n THroTemw, rocygapcTBeHHoe H3AaTejibCTBO 
TexHHKO-TeopeTHHecKofl JiirrepaTypbi, MocuBa, 1955 
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hand, the principles of relativity and of equivalence are of limited application 
and, notwithstanding their heuristic value, they are not unrestrictedly part of 
Einstein’s Theory of Gravitation as expressed by the gravitational equations. 

The author expresses once more his thanks to Professor Kemmer for his 
excellent translation and considers his efforts as a token of friendship. 

Leningrad, June 1962 Y. A. Fock 



INTRODUCTION t 

From the geometrical point of view the theory of space and time naturally 
divides into the theory of uniform, Galilean, space and the theory of non- 
uniform, Riemannian and Einsteinian, space. J 

Galilean space is of maximal uniformity. This means that in it: 

(a) All points in space and instants in time are equivalent. 

(b) All directions are equivalent, and 

(c) All inertial systems, moving uniformly and in a straight line relative to 
one another, are equivalent (Galilean principle of relativity). 

The uniformity of space and time manifests itself in the existence of a group 
of transformations which leave invariant the four-dimensional distance or 
interval between two points. The expression for this interval plays an important 
part in the theory of space and time because its form is directly related to the 
form taken by the basic laws of physics, viz. the law of motion of a free mass- 
point and the law of propagation in free space of the front of a light wave. 

The indications (a), (b) and (c) of the uniformity of Galilean space are related 
to the following transformations: 

(a) To the equivalence of all points and instants corresponds the transforma¬ 
tion of displacing the origins of the spatial coordinates and of time; the 
transformation involves four parameters, namely, the three space 
coordinates and the time coordinate of the origin. 

(b) To the equivalence of all directions corresponds the transformation of 
rotating the spatial coordinate axes; this involves three parameters, the 
three angles of rotation. 

(c) To the equivalence of inertial frames corresponds a change from one 
frame of reference to another moving uniformly in a straight line with 
respect to the first; this transformation involves three parameters, the 
three components of relative velocity. 

The most general transformation involves ten parameters. This is the 
Lorentz transformation. 

It is well known that in a space of n dimensions the group of transformations 
which leave invariant the expression for the squared distance between infinitely 
near points, can contain at most |n(w + 1) parameters. If there is a group 
involving all %n(n + 1) parameters then the space is of maximal uniformity; 
it may be a space of constant curvature, or, if the curvature vanishes, a 
Euclidean or pseudo-Euclidean space. 

t In this introduction we frequently use terms and concepts which will be defined precisely 
only in the subsequent text. This is somewhat inconsistent but, unfortunately, unavoid¬ 
able, for it is our aim to give in this introduction an outline of our attitude to the theory 
discussed in this book. Since, however, we hope that the intending reader has some prior 
knowledge of the subject we feel no great inconvenience will be caused by this manner of 
presentation. If a reader does not fully understand what is said, he will be well advised to 
return to the introduction after reading the book. 

X We shall often use the word “ space ” short for “ space and time ”. 

1 



In the case of space-time the number of dimensions is four and therefore the 
greatest possible number of parameters is ten. This is also the number of 
parameters in the Lorentz transformation, so that Galilean space, to which the 
transformation relates, is indeed of maximal uniformity. 

It is customary to call the theory based on the Lorentz transformations the 
Special Theory of Relativity. More precisely, the subject of that theory is the 
formulation of physical laws in accordance with the properties of Galilean 
space. The creator of the theory of relativity was Albert Einstein (1879-1955). 
Poincare and Lorentz should be considered his predecessors. In this book 
Chapters I-IV are devoted to the theory of Galilean space. 

Universal gravitation does not fit into the framework of uniform Galilean 
space. The deepest reason for this fact was given by Einstein. It is that not 
only the inertial mass, but also the gravitational mass of a body depends on 
its energy. 

It proved possible to base a theory of universal gravitation on the idea of 
abandoning the uniformity of space as a wholef and attributing to space only a 
certain kind of uniformity in the infinitesimal. Mathematically, this meant 
abandoning Euclidean, or rather pseudo-Euclidean, geometry in favour of the 
geometry of Riemann. The modem theory of gravitation was also created by 
Einstein. 

According to gravitational theory there still exists in the infinitely small a 
uniformity like the one expressed by Lorentz transformations. That this is so 
is connected with the fact that in the vicinity of a given point of space the gravi¬ 
tational field can be imitated by a field of acceleration (the so-called Principle 
of Equivalence). The physical basis of this is the law already known to Galileo 
that in the absence of a resistive medium all bodies fall equally fast, or, more 
accurately, with equal acceleration. The law of Galileo can be stated in general¬ 
ized form as the law of the equality of inertial and gravitational mass. It 
should be stressed that this fundamental law is of a general character whereas 
the principle of equivalence is strictly local. The law of the equality of inertial 
and gravitational mass applies to the extent to which it is possible to define 
inertial and gravitational properties of mass as attributes of a particular body, 
independently of the positions and motions of other bodies. (Such indepen¬ 
dence exists in Newton’s theory of gravitation, but does not necessarily hold 
in generalizations of that theory). 

The law of equality of inertial and gravitational mass is the simple physical 
basis on which a theory of gravitation can be constructed; of course, the very 
process of constructing a theory involves introducing new generalizations. As a 
result of this the equality law may acquire the status of an approximate law, 
but this does not detract from its fundamental character. As a basis for con- 

t We use terms such as “ space as a whole ”, “ conditions at infinity ”, etc., not literally, 
but in the mathematical sense accepted in field theory. By “ space as a whole ” we mean a 
region so large that at its boundaries the field due to the systems we are considering is 
negligibly small. “ Conditions at infinity ” refer to the boundaries of this region. The actual 
dimensions of the region can differ according to the nature of the particular problem: for 
an atom or a molecule a distance of the order of a micron can be considered infinitely great, 
for the solar system, a light year, for the galactic system, a thousand million light years. We 
never use “ space as a whole ” to denote the Universe. A consideration of the Universe seems 
to us impossible for gnosiological reasons. 
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structing a more general theory, the law of equality of inertial and gravitational 
mass has the following advantage over the principle of equivalence: in the 
first place it is not confined to local considerations, in the second it avoids 
dealing with frames of reference in accelerated motion—the precise definition 
of which is not easy to make. 

In order to construct a theory of gravitation or to apply it to physical 
problems it is, however, insufficient to study space and time only locally, i.e. 
in infinitely small regions of space and periods of time. One way or another 
one must also characterize the properties of space as a whole. If one does not 
do this, it is quite impossible to state any problem uniquely. This is particularly 
clear in view of the fact that the equations of the gravitational, or any other 
field, are partial differential equations, the solutions of which are unique only 
when initial, boundary or other equivalent conditions are given. The field 
eqnations and the boundary conditions are inextricably connected and the 
latter can in no way be considered less important than the former. But, in 
problems relating to the whole of space, the boundary conditions refer to 
distant regions and their formulation requires knowledge of the properties of 
space as a whole. 

One should note that Einstein did not fully appreciate the inadequacy of a 
local description and the importance of boundary conditions. This is why it is 
necessary to change substantially Einstein’s statement of the basic problems 
of gravitational theory; this has been done in the author’s research and in 

this book. 

The simplest and at the same time the most important case is when one can 
assume space to be uniform at infinity in the sense of the Lorentz transforma¬ 
tion. In this case the non-uniformities arising from material bodies will have 
local character. Masses and their gravitational fields will be, so to speak, 
embedded in unlimited Galilean space. This case is particularly important 
because the existence of integrals of motion is connected with the uniformity 
of space at infini ty. Only if space at infinity allows the full Lorentz trans¬ 
formation with ten parameters do there exist the ten integrals of motion, 
including the energy integral. 

Chapters V, VI and VII of this book are almost solely devoted to the case 
of a space uniform at infinity. 

The assumption is also possible that space-time as a whole is not completely, 
but only partially, uniform; viz., that as before an arbitrary shift of the origin 
of space coordinates and an arbitrary rotation of axes are permissible, giving 
six parameters, but that four parameters of the Lorentz transformation, i.e. the 
three components of relative velocity and the origin of time reckoning, are 
fixed in terms of the first six. Such a space-time was first considered by 
Friedmann and because the spatial part of it obeys Lobachevsky’s geometry, 
it may be called the space of Friedmann-Lobachevsky. In contrast to Galilean 
space, this space allows a well-defined gravitational field to exist when the mean 
density of ponderable matter is different from zero. For this reason, one can 
surmise that when in cosmology one considers enormous regions, thousands of 
millions of light years in extension and approximately uniformly filled with 
galaxies, the space of Friedmann-Lobachevsky will be a better approximation 
to reality than Galilean space. The theory of local non-uniformities in 

3 



Friedmann-Lobachevsky space has hitherto not been developed, and we devote 
only Sections 94-95 of this book to this space. Other assumptions concerning 
the properties of space as a whole are also possible, but we shall not discuss 
them. 

For the question of whether preferred systems of coordinates exist, the 
properties of space as a whole are also of primary importance. 

In Galilean space the familiar cartesian coordinates together with time, are 
preferred. This set of variables is called the set of Galilean coordinates. The 
privileged position of these coordinates arises from the fact that in terms of 
them the Lorentz transformations, which express the uniformity of space, 
prove to be linear. 

In the case of a space uniform only at infinity it also proves possible to 
introduce a preferred system of coordinates, which is well defined up to a 
Lorentz transformation: they are the harmonic coordinates. This fact, first 
proved in the author’s work, has great importance in matters of principle. 
Only with its support can one show in particular that Einstein’s gravitional 
theory retains for the heliocentric Copernican system a preferred position 
compared to the geocentric Ptolomaic one. A more detailed exposition of this 
is given in Sections 92-93. All concrete problems in gravitational theory 
discussed in this book are solved in harmonic coordinates. This ensures that 
the solutions are unique. 

It is probable that in Friedmann-Lobachevsky space there also exist some 
preferred systems of coordinates. However, this question has not been investi¬ 
gated; even the theory of local non-uniformity in this space has not as yet 
been formulated. 

On the question of the existence of privileged systems of coordinates Einstein, 
the founder of gravitational theory, maintained a point of view opposite to 
ours, denying their existence in all cases. This is connected with his afore¬ 
mentioned preference for the local method of discussing properties of space 
(which method is the basis of Riemannian geometry), and his underestimation 
of the importance of considering space as a whole. Undoubtedly the philo¬ 
sophical attitude of Einstein, influenced throughout his life by the ideas of 
Mach, played its part in this. 

An important place in the theory of space and time is taken up by the question 
of different coordinate systems and the change in the appearance of equations 
on going from one such system to another. 

Great importance is usually attached to the covariance of equations. By 
this is meant the following: We consider a transformation of the coordinates 
accompanied by a transformation of dependent variables, or functions, according 
to some specified rule, for instance, the tensor rule. We examine the appearance 
of the equations satisfied by the original and the transformed functions. If, as 
a result of the transformation, the new functions of the new variables satisfy 
equations of the same appearance as the old functions of the old variables, the 
equations are called covariant. Covariance permits one to write down equations 
without presupposing any particular coordinate system. In addition, the 
requirement of covariance of equations has great heuristic value because it 
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limits the variety of possible forms of equations and thereby makes it easier 
to choose correct ones. However, one should stress that the equations can so 
be limited only under the necessary condition that the number of functions 
introduced is also limited; if one permits the introduction of an arbitrary 
number of new auxiliary functions, practically any equation can be given 
covariant form. 

Thus, covariance of equations in itself is in no way the expression of any 
kind of physical law. For instance, in the mechanics of systems of mass-points 
Lagrange’s equations of the second kind are covariant with respect to arbitrary 
transformations of the coordinates, although they do not express any new 
physical law compared to, for instance, Lagrange’s equations of the first kind, 
which are stated in cartesian coordinates and are not covariant. 

In the case of Lagrange’s equations covariance is achieved by introducing as 
new auxiliary functions the coefficients of the Lagrangian considered as an 
expression quadratic, but not necessarily homogeneous in the velocities. 

In Riemannian geometry the new functions at our disposal are the coefficients 
g (iV of the quadratic form for the squared infinitesimal distance. The law 
according to which these functions transform in passing to a new coordinate 
system follows from their definition as coefficients of a quadratic form, together 
with the condition that this form is an invariant; in the following we shall 
always assume that a transformation of the coordinates is accompanied by a 
transformation of the </ (iV according to this law. The set of quantities g^ is 
called the metric tensor. 

Having introduced the metric tensor, one can form expressions that are 
covariant with respect to any coordinate transformation. If we consider only 
those metric tensors which are obtainable from a particular one (e.g. from the 
Galilean tensor) by coordinate transformation, this will give us nothing new 
except the fact that our equations are covariant. But if we extend the dis¬ 
cussion to metric tensors of a more general form, tensors which cannot be 
transformed into one another by coordinate transformations, the generalization 
so obtained is an essential one. In this case the metric tensor will express not 
only properties of the coordinate system but also properties of space, and these 
latter can be related to the phenomenon of gravitation. It is Einstein’s idea 
that the description of the gravitational field demands the introduction of no 
functions other than the metric tensor itself. This idea of the unity of metric 
and gravitation is the very basis of Einstein’s theory of gravitation. It leads 
almost uniquely to the equations of the gravitational field found by Einstein. 

Having clarified the concept of covariance as applied to Riemannian geo¬ 
metry, let us now consider it together with the previously discussed concept of 
the uniformity of space. As was shown above, the property of uniformity in 
Galilean space manifests itself in the existence of transformations that leave 
unchanged the expression for the four-dimensional distance between two points. 
More precisely, these transformations leave unchanged the coefficients of this 
expression, i.e. the quantities g^. If the </ (iv are functions of the coordinates we 
mean by this that the mathematical form of these functions is unchanged: the 
dependence of the new // Iiv on the new coordinates has the same mathematical 
form as that of the old c/ (lv on the old coordinates. In the general case of 
Riemannian geometry there are no transformations that leave the unchanged 
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because Riemannian space is not uniform. In Riemannian geometry one deals 
with transformations of coordinates accompanied by transformations of the g^ v 
and neither such a combined transformation nor covariance with respect to it, 
has any relation to the uniformity or non-uniformity of space.f 
These statements can be illustrated by a simple example. 

Let us consider the surface of a sphere. This surface can be transformed into 
itself by meaps of rotations by any angle around any axis passing through the 
centre. Such a transformation involves three parameters, but since the surface 
of the sphere is two-dimensional it follows from the remarks at the beginning of 
this introduction that it will be of maximum uniformity. On the surface of a sphere 
there are no preferred points or preferred directions. 

Let us now consider a more general non-spherical surface of revolution. It also 
can be transformed into itself by rotation about an axis, but now the direction 
of the axis is tixed and only the angle of rotation remains arbitrary, so that the 
allowed transformations involve only a single parameter. Certain points on the 
surface will be singled out (the poles through which the axis passes) and so will 
certain directions (meridians and latitude circles). Finally, if we consider a surface 
of general form, there will be no transformations taking it into itself. Thus the 
sphere, a surface of a very particular form, possesses the greatest degree of 
uniformity (3 parameters), a surface of revolution has partial uniformity (1 para¬ 
meter), while a surface of general form possesses no uniformity whatsoever. 

We see that the generality of the form of the surface is a concept antagonistic 
to the concept of uniformity. This conclusion remains valid for the geometry of 
the 4-dimensional space-time manifold : the generality of its geometry is an¬ 
tagonistic to its uniformity. 

We shall now clarify the meaning given to the concept of relativity in the 
theory of space, and time and will show that it is extremely closely related to 
the concept of uniformity. 

When speaking of the relativity of a frame of reference or simply of relativity, 
one usually means that there exist identical physical processes in different 
frames of reference (for more detail see Sections 6 and 49*). According to the 
generalized Galilean principle of relativity identical processes are possible in all 
inertial frames of reference related by Lorentz transformations. On the other 
hand, Lorentz transformations characterize the uniformity of Galilean space- 
time. Thus the principle of relativity is directly related to uniformity. This 
also shows that the nomenclature introduced in Einstein’s first papers, by 
which the theory of uniform Galilean space is named “ Theory of Relativity ” 
can to some extent be justified. 

In the following we shall see that relativity in the sense defined above is 
related to uniformity in all those cases in which the space-time metric can be 
considered fixed. This can be done not only in the theory of Galilean space, 
where the metric is given once and for all, but also in the theory of Riemann 
and Einstein space, provided only the processes under discussion have no prac¬ 
tical influence on the metric. In these cases relativity is uniquely related to 
uniformity: in uniform Galilean space it exists, in non-uniform Riemannian space 
it is absent. If, how ever, the discussion is to include processes which themselves 
essentially influence the metric (displacement of heavy masses) it will be shown 


f These ideas were expressed by E. Cartan [1]. 

J The name Chronoge.ome.try suggested by A. D. Fokker flaj would be more appropriate. 
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at the end of the book (Section 93) that relativity can to a certain extent be 
retained, even in inhomogeneous space, albeit only in those cases in which the 
non-uniformity produced by heavy masses may be treated as a local perturbation 
in infinite Galilean space. 

The preservation of a certain degree of relativity, even in non-uniform space, 
is due to the fact that in going over to a new frame of reference the previous 
gravitational field is replaced by a new one, which has the same form in the new 
frame as the old in the old frame. Thus, if a laboratory on the Earth is turned 
upside down, the processes in it will be disturbed (there is no relativity) but if 
the reversal is accompanied by a parallel transport of the laboratory to the anti¬ 
podes, the course of all processes in it will be the same as in the beginning (rela¬ 
tivity is maintained). 

Thus, even in non-uniform space the still existing relativity is indirectly 
connected with uniformity, namely with the uniformity at infinity if one dis¬ 
cusses relativity “ in the large”, and with local uniformity in a geodesic co¬ 
ordinate system if one is discussing relativity “ in the infinitely small ”. We 
have seen, however, that uniformity stands in an antagonistic relation to the 
degree of generality of the geometry; this is why relativity either does not exist 
at all in a non-uniform space with Riemannian geometry, or else it does exist, 
but does not go beyond the relativity of Galilean space. But there can be no 
question of a generalization of the concept of relativity in going over to non- 
uniform space. 

However, when Einstein created his theory of gravitation, he put forward 
the term “ general relativity ” which confused everything. This term was 
adopted in the sense of “ general covariance ”, i.e. in the sense of the covariance 
of equations with respect to arbitrary transformations of coordinates accom¬ 
panied by transformations of the g. M . But we have seen that this kind of 
covariance has nothing to do with the uniformity of space, while in one way or 
another relativity is connected with uniformity. This means that “ general 
relativity ” has nothing to do with “ relativity as such ”. At the same time the 
latter received the name “ special ” relativity, which purports to indicate that 
it is a special case of “ general ” relativity. 

To show what misunderstandings resulted we shall consider some examples. 
As ■will be shown in Chapter IV, the theory of uniform Galilean space can be 
formulated not only in a way covariant with respect to Lorentz transformations, 
but also in a generally covariant manner. In the language of “ general ” and 
“ special ” relativity it would be extremely difficult to express this simple idea, 
and we shall not attempt to do so, for we would have to say that “ special ” 
relativity includes “ general ” relativity or something of that kind. 

Remembering that even in Newtonian mechanics one deals with the generally 
covariant Lagrange equations of the second kind, one would also have to say 
that Newtonian mechanics contain in itself “ general ” relativity. 

It has become a convention, since Einstein, to use the term “ general ” 
relativity or “ the general principle of relativity ” in the sense of “ theory of 
gravitation ”. Einstein’s fundamental paper on the theory of gravitation (1916) 
is already entitled “ Foundations of the General Theory of Relativity ”. This 
confuses the issue still further because the words “ general ” and “ relativity ” 
are not used with their proper meaning. In the theory of gravitation, space is 
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assumed non-uniform whereas relativity relates to uniformity so that it appears 
that in the general theory of relativity there is no relativity. 

Enough has been said to make clear that the use of the terms “ general 
relativity ”, “ general theory of relativity ” or “ general principle of relativity ” 
should not be admitted. This usage is inconvenient because it not only leads 
to misunderstanding, but also reflects an incorrect understanding of the theory 
itself. However paradoxical this may seem, Einstein, himself the author of 
the theory, showed such a lack of understanding when he named his theory and 
when in his discussions he stressed the word “ general relativity ”, not seeing 
that in the new theory he had created, the notion of relativity was not among 
the concepts subjected to generalization. 

In the present book we do not use the term “ general relativity ”. Following 
established practice, we call the theory of Galilean space the Theory of Relativity, 
but without the adjective “ special ”. We call the theory of Einstein space the 
Theory of Gravitation, not the “ general theory of relativity ”, because the 
latter name is nonsensical 

* * * * 

The philosophical side of our views on the theory of space, time and gravi¬ 
tation was formed under the influence of the philosophy of dialectical 
materialism, in particular under the influence of Lenin’s “ Materialism and 
Empiriocriticism”, The teachings of dialectical materialism helped us to 
approach critically Einstein’s point of view concerning the theory created by 
him and to think it out anew. It helped us also to understand correctly, and to 
interpret, the new results we ourselves obtained. We wish to state this here, 
although this book does not explicitly touch philosophical questions. 
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CHAPTER 1 


THE THEORY OF RELATIVITY 

1. Coordinates of Space and Time 

Space and time are primary concepts. In the philosophical sense they are 
the forms of existence of matter. This can be understood as meaning that the 
concepts of space and time are obtained by appropriate abstractions from the 
concept of spatio-temporal relations between material processes. The simplest 
notion relating to space and time is a point in space considered at a particular 
instant in time. To “ mark ” a point in space one must place there a material 
body of sufficiently small dimensions. The position of this body can only be 
fixed relative to other material bodies since there does not exist any cartographic 
grid inherent in space and independent of material bodies. When a frame of 
reference, or “ base ”, is chosen, i.e. a system of material bodies relative to 
which the position of the given point body is measured, that position can be 
fixed in terms of three coordinates relating to a particular instant in time, 
reckoned according to a clock at the base. Generally speaking coordinates are 
auxiliary quantities characterizing the disposition of bodies relative to the frame. 
They allow one to calculate according to the laws of Euclidean geometry the 
relative positions of the bodies, in particular their distances apart and the angles 
between the directions of lines joining them. Usually one takes the coordinates 
to be rectilinear and orthogonal because such coordinates are simplest to relate 
to lengths and distances, but any other (curvilinear) coordinates are admissible, 
for instance, two angles defining the direction towards a point-body and the 
distance to it. 

One should stress that both the coordinates themselves and the distances, 
angles, etc., calculated from them in describing the relative positions of bodies 
have a definite meaning only when a particular base is postulated. In the same 
way instants in time to which the coordinates and the distances refer, and 
likewise time differences, become well-defined only when a definite spatial base 
and a definite time-measure in that base are assumed, i.e. a definite frame of 
reference. 

In this way the variables x, y, z and t, the space and time coordinates with 
which we shall deal in the following, are related to a base. 

2. The Position of a Body in Space at a given Instant, in a Fixed 
Reference Frame 

Let us first disregard time and consider the usual methods for determining 
the situation of objects in space. The logical foundation of these methods is, 
in principle, the hypothesis that Euclidean geometry is applicable to real 
physical space together with two further assumptions, viz. that rigid bodies 
exist and that light travels in straight lines. In fact, to find the position of a 
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distant object one has to use a rigid graduated rule to map out a definite base 
in the sense of ordinary triangulation, and a light ray to take bearings on the 
object at different points of the base. Assuming that the light rays are straight 
lines the laws of Euclidean geometry then allow one to calculate the distance 
to the object and all other data characterizing its position. The rectilinear 
nature of light rays in vacuo is a basic postulate. In the atmosphere light rays 
are only approximately straight and corrections for refraction, etc., must be 
made. The validity of the laws of Euclidean geometry in the real physical 
world should be taken as an experimental fact, not as an a priori assumption. 
Indeed, although these laws can be verified with an enormous degree of accuracy, 
Einstein’s theory of Universal Gravitation is based on the consideration of 
small departures from them. 

Thus the properties of fight and the properties of rigid bodies play a basic 
part in establishing the geometry of real physical space. 

However, one must note that in this connection the concept of a rigid body is, 
to some extent, auxiliary. Absolutely rigid bodies do not exist; one can take 
actual physical bodies to be rigid and to possess invariable geometric dimensions 
only as an approximation and only in certain conditions, namely at constant 
temperature, in the absence of elastic vibrations, etc. The invariability of a 
standard of length can be checked with greatest accuracy by comparing it with 
the wave length of some definite spectral fine. Thus the notion of length ulti¬ 
mately depends on the properties of atoms or molecules radiating the spectral 
fine and on the properties of fight. 

Another method for determining the disposition of objects in space, which 
is in principle different from triangulation, is radiogeodesy or radar. Basically 
this method consists of emitting radio signals from a certain point and reflecting 
them from the object to be observed so that they return to the point of origin. 
The time elapsing during the passage of the signal there and back is noted 
and also, of course, the direction. Knowing the speed of propagation of a radio 
signal-—it is in fact equal to the speed of light—one obtains the distance to the 
object as the product of the speed and half the time taken by the signal. 

As a matter of principle this method is important because it reduces the 
measurement of length to a measurement of time intervals and does not make 
use of the properties of absolutely rigid bodies. The essential assumption is 
that the speed of fight is constant. Here this speed plays the part of a con¬ 
version factor from time to length. Its numerical value must be determined by 
other experiments and these do have to use a standard of length. 

In principle, a time measurement can be performed with the aid of any 
periodic process. At present the clocks of greatest precision rely on the proper 
vibrations of a quartz crystal or of an ammonia molecule. In practice astro¬ 
nomical time measurements are used which rely on applying Newton’s laws of 
motion to the rotation of the earth, with due observance of all deviations from 
uniform rotation that the theory predicts, such as nutation, etc. 

Such methods of time measurement make it possible to regulate a clock at 
the base. 

When measuring the position of moving bodies from a fixed base the question 
arises: to what instant, by the clock at base, does the measured value of distance 
or any other spatial coordinate refer? We accept the following definition: If 
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the light signal (or the signal from the radar station) is sent off at a time 
and returns at t 2 , then the value of the coordinates x, y, z of the object and also 
the distance r = c(< 2 — fJ/2, refer to the instant t — (t x + < z )/2. This definition 
is in accord with the natural assumption that the speeds of light to and fro 
are equal. 

We have shown that, starting from a definite base and equipped with gradu¬ 
ated rules, clocks and other necessary apparatus, we can measure out the 
position of bodies in space with reference to the base and refer the resulting 
values of coordinates, such as the cartesian ones x, y and z, to a definite instant t, 
which also refers to a clock at base. In the following we shall call this kind of 
base a reference frame. We shall still use the word “base” in those cases when it 
is desirable to stress the fact already pointed out at the beginning of Section 1, 
namely, that a reference frame is not some kind of coordinate grid in space 
together with some “ universal time ”, but is connected with rules and clocks 
located at a definite place and moving in a definite way. 

3. The Law of Propagation of an Electromagnetic Wave Front 

The laws of propagation of light in empty space are thoroughly understood. 
They find their expression in the well-known equations of Maxwell 

1 8E 

curl H -— = 0, div E = 0 

C m (3.01) 

1 8U 

curl E -=0, divH = 0 

c 8t 

where E and H are the vectors of electric and magnetic field intensity. However, 
we are not interested in the general case of light propagation, but only in the 
propagation of a signal advancing with maximum speed, i.e. the propagation 
of a wave front. Ahead of the front of the wave all components of the field 
vanish. Behind it some of them are different from zero. Therefore, some of 
the field components must be discontinuous at the front. 

On the other hand, given the field on some surface moving in space, the 
derivatives of the field on the surface are, in general, determined by Maxwell’s 
equations. Hence the value of the field at an infinitely near surface is also 
determined and discontinuities of the field are impossible. The only case when 
this is not so is when the form and motion of the given surface satisfies certain 
special conditions subject to which the value of the derivatives is not determined 
by the values of the field components themselves. Such a surface is called a 
characteristic surface or, briefly, a characteristic. Thus, discontinuities of the 
field can occur only on a characteristic, but since there must certainly be dis¬ 
continuities at a wave front, such a front is clearly a characteristic. 

Let us determine the equation of a characteristic for the system of Maxwell’s 
equations. 

Let the value of the field be given for those points and instants whose 
coordinates are related by the equation 

t ~~ ~ / (•*'• //, z) ( 3 - 02 ) 

In particular, if/ s= 0 this amounts to stating initial conditions. Equation (3.02) 
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may be looked upon as the equation of a certain hypersurface in the four-dimen¬ 
sional space-time manifold. When (grad f) 2 > 1 the same equation can be 
considered as the equation of an ordinary surface moving through space (see 
also Section 35). Assume that on the hypersurface (3.02) the values of a certain 
function u are given 



= u 0 {x, y, z) 


(3.03) 


The following combinations of derivatives with respect to space and time 
coordinates will thereby also be given 


, 1 ??. M. = ^ (i = 1 2 31 
dx{ c dt dxi dxi 


(3.04) 


Here we have used x v x 2 , x 3 for brevity instead of x, y, z. If, in addition, the 
derivative du/dt were given, the values of all the derivatives of u at the surface 
would be known. 

Let us take as the function u one of the components of electromagnetic field, 
for instance E x , and denote by E° = E*.(x, y, z) the value of this component 
on the hypersurface. We shall use the same notation (superfix 0) for the other 
components. If the field is given on the hypersurface, the whole set of quantities 
E° and H° can be considered as known functions of x, y and z. As in (3.04) we 
shall have for instance 



dE x \8E X 8f _ dEl 
dz c dt 8z dz ’ 6 

(3.05) 

whence 

1 / gE' 

div E + - (grad/ —^ 

) = Div E° 

(3.06) 


, ^ 1 T , , dEl 

curl E + - grad/ x — 

= Curl E° 

(3.07) 

and also 

divH + l(grad/-™) 

| = Div H° 

(3.08) 


curlH + iJgmd/x-] 

= Curl H® 

(3.09) 


In the last equations we have denoted the operators div and curl when applied 
to E° and H° by Div and Curl. 

The values of the six functions E° and H° are not independent: they must 
satisfy two relations which we shall now give. 

Taking the scalar product of equations (3.07) and (3.09) with grad/ and 
using Maxwell equations (3.01) we get 

(grad/ • Curl E°) = - ~ ^grad/ • ^ (3.10) 

(grad/ • Curl H°) = ^ ^grad/ • ^ (3.11) 

Now the first terms on the left-hand sides of both equations (3.08) and (3.06) 
vanish by virtue of Maxwell’s equations and the second terms are the same as 
the right-hand sides of (3.10) and (3.11). We have, therefore, 

Div H° + (grad/-Curl E°) = 0 


(3.12) 
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(3.13) 


These are the conditions which the given functions E° and H° must satisfy. 
In the case that/ = const., we have to choose initial values in the ordinary way 
and then the conditions amount to the obvious requirement that div E and 
div H must both vanish at the initial instant. 

Using Maxwell’s equations we can write (3.07) and (3.09) in the form 


^ + j^grad/ X^]=c CurlH 0 
-| I + [grad/ X ^j=c Curl E° 


(3.14) 

(3.15) 


Multiplying these equations vectorially by grad / and using the relations just 
deduced, we transform them into : 


{1 - (grad/) 2 } — = £° - (grad/- £°) grad/ - [grad/ x ft 0 ] (3.16) 
ot 

QII 

{1 - (grad/) 2 } — = ft 0 - (grad/ • ft 0 ) grad/ + [grad/ x £°] (3.17) 
ot 

Here we have written for brevity 

£° = c Curl H° ; ft 0 = - c Curl E° (3.18) 


The functions on the right-hand side of (3.16) and (3.17) are all known. 
These equations can be solved for the time derivatives if their coefficient, i.e. 
the quantity 1 — (grad /) 2 , is not zero. In that case equations analogous to 
(3.05) lead to finite values of all the other first derivatives of the field, so that 
the field itself will be continuous on the surface (3.02). If there is to be a 
discontinuity there, the coefficient of the time derivatives must thus necessarily 
be zero, giving the condition 

(grad /) 2 = 1 (3.19) 

A surface satisfying this equation will be a characteristic. If we write the 
equation of a surface without solving for time, but instead in the form 

a(x, y, z,t) — 0 (3.20) 

the equation (3.19) for the characteristic takes on the form 

i - (grad co ) 2 = 0 (3.21) 

Therefore the propagation of an electromagnetic wave front satisfies this 
equation (3.21). 

Particular functions satisfying the equation (3.19) or (3.21) are 

t — - (ccc + (fy -f y z) (a 2 + p 2 + y 2 = 1) (3.22) 

c 

t = t 0 + - y/[(x - x 0 Y + (y - y 0 ) 2 + (z- z 0 ) 2 ] 
c 

The former gives a plane wave front, the latter a spherical one. 


(3.23) 
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4. Equations for Rays 

The equation describing the propagation of a wave front can be written in 
the form 

^+eV(« 2 +«*+«*) = 0 (4.01) 

(for definiteness we have chosen the plus sign before the square root). The 
form of this equation is the same as that of the Hamilton-Jacobi equation in 
mechanics with to playing the part of the action function S and the derivatives 
a>x, <o y and w 2 , the part of the “ momenta ”, p x , p y and p z . Corresponding to 
the Hamiltonian we have here the expression 

H = cV(w| + u> 2 v + cof) (4.02) 

To the trajectories of mechanics there correspond light rays. The equations 
for them are analogous to Hamilton’s equations. They can be written as 


dx 

8H 


co x 


(4.03) 

— = 

- - — 

'■ C ~ 77" n 

-, etc. 

dt 

dco 

X 

VK 

+ C0 2 

+ <■>*)’ 


d(&x 

SH 



(4.04) 


dt 

dx 

= 0, 

etc. 


Equation (4.04) shows that the quantities a x , <o y and co 2 are constant along a 
ray, though they can, of course, vary from one ray to another. Therefore the 
rays will be straight: 


1£> X 


V(“| + “y + 


(t —1 0 ), etc. 


(4.05) 


If the sign of w, and hence of w 2 , oo v and co 2 is changed, the direction of the 
ray is reversed ; the sign must be chosen according to the given sense of direction 
of the ray. 

Any wave surface can be considered as formed of points moving along the 
rays with the speed of light according to (4.05). 

We thus have the possibility of constructing a wave surface at time t when 
its form at time t 0 is known. Let the equation of the wave surface at time 
have the form 


<-> # (z 0 , y 0 , z 0 ) = 0 


(4.06) 


where x 0 , y 0 and z 0 are coordinates varying over this surface, 
equation of the surface we can calculate the quantities 


a(»o. Vo 


i> z o) — | 


V« + < + CO' 




etc. 


Knowing the 


(4.07) 


Here the sign of the right-hand sides is determined by the given direction of 
wave propagation. The equation of the ray passing through the point ( x 0 , y 0 , z 0 ) 
of the initial wave surface is 


x — x 0 — ccn{t — < 0 ) 

V - Vo = cP(< — t 0 ) (a 2 + P 2 + Y 2 = 1) ( 4 - 08 ) 

z — z 0 =r- cyit — t 0 ) 

The quantities x, y and z give the positions of the point to which the point 
(a: 0 , y 0 , z 0 ) moves at time t. Allowing x 0 , y 0 and z 0 to take on all values which 
satisfy co° = 0, we obtain from (4.08) all points which at time t lie on the 
wave surface. 
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If we solve (4.08) for a; 0 , y 0 and z 0 and insert the functions 

x 0 = x 0 (x, y,z,t — t 0 ), etc. (4.09) 

into the wave surface equation co° = 0, we get the relation 

o>(x, y,z,t- t 0 ) = 0 (4.10) 

which is the explicit form of the equation of the wave surface at time t. At 
t = t 0 , obviously, x 0 — x, y 0 — y, z 0 — z and equation (4.10) reduces to (4.06), 
which is the equation of the initially given wave surface. 

From the ray equation (4.05) there follows the relation 

c 2 (< - t 0 Y - [(* - x o y + (y - y 0 ) 2 + (z - z 0 ) 2 ] = 0 (4.11) 

which connects the coordinates of the initial and final points on each ray. It 
is the equation of a sphere centred at the point x 0 , y 0 , z 0 and of a radius 
R —c(t — t 0 ) that increases linearly with time. Just as equation (3.21), from 
which we started, this equation expresses the fact that the velocity of light 
propagation is constant. 

For points infinitesimally separated relation (4.11) takes on the form 

c 2 dt 2 — (da; 2 + dy 2 + dz 2 ) = 0 (4-12) 

In this form the equation follows directly from Hamilton’s equation f4.03). 

A more general treatment of the problem of integrating the wave front equa¬ 
tion is given in Appendix F. 

5. Inertial Frames of Reference 

In Section 2 we explained what is meant by a frame of reference and showed 
that it is essentially a certain base equipped with graduated rules and clocks ; 
it can be roughly represented as a kind of radar station. The important thing 
is that the base consists of certain material .bodies that have some definite 
position in space and some definite motion. 

To describe any phenomenon, in particular propagation of light or motion of 
material bodies, requires a definite frame of reference or base. For example, 
the motion of the planets is normally described in the heliocentric system. 
The origin of this system, i.e. the position of the imaginary base, is the mass- 
centre of the solar system and the three axes are chosen to point to three fixed 
stars, by which choice the orientation of the base is fixed. 

In general, the mathematical form of the laws of nature will be different in 
different frames. Thus, for instance, the motion of bodies relative to the Earth 
may be described either in a frame with axes pointing to three fixed stars or in 
one rigidly fixed to the Earth and in the latter case one has to introduce Coriolis 
forces into the equations of motion. 

There exist frames of reference in which the equations of motion have a 
particularly simple form ; in a certain sense these are the most “ natural ” 
frames of reference. They are the inertial frames in which the motion of a body 
is uniform and rectilinear, provided no forces act on it. (Here the question 
arises how to tell that no forces are acting on the given body; we shall take 
it that this is the case if all bodies capable of exerting forces are sufficiently 
distant.) The heliocentric frame is an inertial one to a very good approximation. 
In an inertial frame we shall almost always use cartesian coordinates, because 
in terms of them it is particularly simple to formulate both the laws of Euclidean 
geometry and the laws of mechanics, in particular, the aforementioned law of 
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the rectilinear uniform motion of a body not subject to forces. Of course, the 
very concept of an inertial frame permits one to go over to other, especially 
curvilinear, coordinates and if the equations of the transformation do not 
involve time explicitly one can take it that both the initial and the transformed 
coordinates refer to the same inertial frame. 

In pre-relativistic physics the notion of an inertial system was related only 
to the laws of mechanics. Newton’s first law of motion is, in fact, nothing but a 
definition of an inertial frame. 

However, we have seen that the laws of light propagation play a fundamental 
part in the definition of the basic concepts relating to space and time. Therefore 
it proves more correct to relate the notion of an inertial frame not only to the 
laws of mechanics but also to those of light propagation. 

The usual form of Maxwell’s equations refers to some inertial frame. It is 
obvious and has always been assumed, even before Relativity, that at least one 
reference frame exists that is inertial with respect to mechanics and in which 
at the same time Maxwell’s equations are true. The law of propagation of an 
electromagnetic wave front in the form 

also refers to this inertial frame. A frame for which (5.01) is valid may be 
called inertial in the electromagnetic sense. A frame that is inertial both in 
the mechanical and in the electromagnetic senses will be simply called inertial. 

Thus, by the definition we have adopted, an inertial frame is characterized 
by the following two properties: 

1. In an inertial frame a body moves uniformly and in a straight line, provided 
no forces act on it. (The inertial property in the usual mechanical sense.) 

2. In an inertial frame the equation of propagation of an electromagnetic 
wave front has the form (5.01). (The inertial property for the field.) 

We have spoken only of the propagation of an electromagnetic wave, thereby 
apparently giving this field preference over other fields. However, the preference 
is only apparent. The maximum speed of propagation of all fields must actually 
be the same, so that equation (5.01) is of universal validity. This question will 
be discussed in detail in the following section. 

6. The Basic Postulates of the Theory of Relativity 

The fundamental postulate of the theory of relativity, also called the Principle 
of Relativity, asserts that phenomena occurring in a closed system are inde¬ 
pendent of any non-accelerated motion of the system as a whole. 

We shall attempt to make the content of this postulate more precise.f 

Assume a closed system of material bodies and a variety of mechanical, 
electromagnetic or any other processes taking place within the system. We shall 
describe the state of the system, including the electromagnetic and other fields 
involved, with reference to some definite frame, or base in the sense of Section 2. 
Consider two frames. Let the first be inertial and let the second move uniformly 
in a straight line with respect to the first. We define the initial state of the 
system in relation to the first frame, using the word “ initial ” in the sense of 

f See L. I. Mandelstam [6]. 
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relating to the initial instant according to a clock of the frame We then 
imagine a copy of our system which has the same initial state with respect to 
the second frame as the original system had with respect to the first, with 
“ initial ” now, of course, defined with respect to the other frame. We shall 
compare the subsequent course of events in the original system, described in the 
first frame, with the course of events in the copy, as described in the second frame. 

The Principle of Relativity asserts that the two sequences of events will be 
exactly the same (at least insofar as they are determined at all). If a process 
in the original system can be described in terms of certain functions of the space 
and time coordinates of the first frame, the same functions of the space and time 
coordinates of the second frame will describe a process occurring in the copy. 

One can say more descriptively, if less precisely, that the uniform rectilinear 
motion of a material system as a whole has no influence on the course of any 
•processes occurring within it. 

Many natural laws governing the course of various physical processes can 
be stated in terms of differential equations, the form of which does not depend 
on the initial state of the system concerned. The initial conditions enter as 
supplementary equations. This is so, for example, for the equations of motion 
in mechanics, and also for the equations of the electromagnetic field and the 
associated equation for the propagation of an electromagnetic wave front. It 
follows then from the Principle of Relativity that the mathematical form of 
such laws must be the same in the initially chosen frame and in any other 
which moves uniformly and in a straight line with respect to it. Remembering 
our definition of an inertial frame (in the mechanical sense and for fields) we 
are led to the following important conclusion : If an inertial frame is given, any 
other frame that moves uniformly and rectilinearly with respect to it is itself inertial. 
This postulate will serve us as the basis from which to derive the formulae 
linking the space and time coordinates in two frames moving without acceleration 
with respect to one another. 

The Principle of Relativity is confirmed by the whole of our experience of 
the natural world. In the domain of mechanics the principle has long been 
known; it is the Galilean relativity principle. Einstein’s achievement was to 
extend it to all phenomena (though in the first place to electromagnetism) and 
to derive from it certain consequences regarding the interrelation of space 
and time. 

The Theory of Relativity can be built up on two postulates, namely, the 
Principle of Relativity and another principle that states that the velocity of 
light is independent of the velocity of its source. We have accepted the latter 
principle from the first, for we have based our development on the law of 
propagation of an electromagnetic wave front and the second principle is an 
immediate consequence of this law. 

It is appropriate to give here a generalized interpretation of the law of wave 
front propagation and to formulate the following general postulate : 

There exists a maximum speed for the propagation of any kind of action. This 
is numerically equal to the speed of light in free space. 

This principle is very significant because the transmission of signals with 
greatest possible speed plays a fundamental part in the definition of concepts 
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concerning space and time. The very notion of a definite frame of reference 
for describing events in space and time depends on the existence of such signals. 
In Section 2 we have considered methods of determining the positions of bodies 
in space that were based on the use of light, or other electromagnetic, signals. 
In principle, however, signals could be transmitted not only by electromagnetic 
waves but by waves of some other nature. One could, for instance, imagine 
signalling with particles having the limiting speed and with the matter-waves 
that correspond to them in quantum mechanics. Also conceivable, though 
unattainable in practice, would be to use gravitational waves, the existence of 
which follows from Einstein’s theory of Gravitation (see Section 90). It is not 
excluded that other fields capable of transmitting signals might be discovered. 
Therefore the question arises : is the notion of a reference frame sufficiently 
general, if its definition is based on the use of electromagnetic signals only ? 
For if more rapid means of signalling existed, or even instantaneous transmission, 
the concept of a reference frame defined in terms of light propagation could not 
aptly reflect the properties of space and time but would, at best, give one 
possible form of description. 

The principle formulated above, by asserting the existence of a general upper 
limit for all kinds of action and signal, endows the speed of light with a universal 
significance, independent of the particular properties of the agency of trans¬ 
mission and reflecting a certain objective property of space-time. 

This principle has a logical connection with the Principle of Relativity. 
For if there was no single limiting velocity but instead different agents, 
e.g. fight and gravitation, propagated in vacuo with different speeds, then the 
Principle of Relativity would necessarily be violated as regards at least one of 
the agents. 

The principle of the universal limiting velocity can be made mathematically 
precise as follows : 

For any hind of wave advancing with limiting velocity and capable of trans¬ 
mitting signals the equation of front propagation is the same as the equation for 
the front of a light wave. 

Thus the equation 

- (grad “ )2 ^° (6 ' 01) 

acquires a general character ; it is more general than Maxwell’s equations from 
which we derived it. As a consequence of the principle of the existence of a 
universal limiting velocity one can assert the following: the differential 
equations describing any field that is capable of transmitting signals must be 
of such a kind that the equation of their characteristics is the same as the 
equation for the characteristics of fight waves.f 

We shall see in Chapters V-Vl'I that the presence of a gravitational field 
somewhat alters the appearance of the equation of the characteristics from the 
form (6.01), but in this case one and the same equation still governs the propa- 

t The work of A. 1). Aleksandrov [2, 3] contains a formulation of the basic postulates of 
tho theory of relativity as a theory of the “ absolute ” properties of space and time, which is 
close, to ours. The ideas of A. 1). Kokker [la] and the name Chronogeometry proposed by 
him for this theory also underline the "absolute" side of spatio-temporal relations. 
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gation of all kinds of wave fronts travelling with limiting velocity, including 
electromagnetic and gravitational ones. 

In the preceding section we have taken equation (6.01) together with the fact 
of the uniform rectilinear motion of a body not subject to forces, as the basis 
for defining inertial reference frames. 

The principle just formulated asserting the universality of the equation shows 
that such a definition of inertial systems is appropriate. 


7. The Galileo Transformations and the Need to Generalize Them 

Let one and the same phenomenon be described in two inertial frames of 
reference. The question arises of passing from the first frame to the second in 
the description of some phenomenon. For a rough illustration we can imagine 
two radar stations, one on the Earth, the other on an aeroplane or on a Sputnik; 
the problem is to convert from the data recorded at the first station to those 
found at the second. 

For such a conversion one must know first of all the relation between the 
space and time coordinates x, y, z and t in the first frame and the corresponding 
x', y', z' and t' in the second. In pre-relativity physics one accepted as self- 
evident the existence of a universal time t, the same for all frames; on that 
view one had to put t' = t or f — t — t 0 , if a change of time-origin was admitted. 

Considering two events occurring at times t and t, the old point of view 
required the time elapsed between them to be the same in all reference frames 
so that 

l _ T = (' _ T ' (7.01) 

Further, it was considered to be evident that the length of a rigid rod, measured 
in the two frames, would have the same value. (Instead of the length of a rigid 
rod one could equally well consider the distance between the “ simultaneous ” 
positions of two points which need not necessarily be rigidly connected.) 
Denoting the spatial coordinates of the two ends of the rod (or the two points) 
by (x, y, z) and (£, 7 ), Q in the one frame and by (x', y', z') and (£', ■/)', £') in the 
other the old theory required 

(* - w + (y- r)) 2 + (Z- Q 2 = (*' - zy + ( y ' - nT + (*' - O* 

(7.02) 


Equations (7.01) and (7.02) determine uniquely the general form of the trans¬ 
formation connecting x, y, z and t with x'. if , z' and t'. It consists of a change 
in origin of spatial coordinates and of time, of a rotation of the spatial axes and 
of a transformation such as 


x' = X — V x t 

y' ---■ y -V y t 

z' — z — V z t 
t' - t 


(7.03) 


wjiere. V x , V y and V z are constants whose physical meaning is easy to see: they 
give the velocity with which the primed frame moves relatively to the unprimed 
one; more exactly they arc the components of this velocity in the unprimed frame. 
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The transformation (7.03) is called a Galileo transformation. 

Thus, pre-relativistic physics asserted that, given an inertial frame ( x , y, z), 
space and time coordinates in any other frame moving uniformly and recti- 
linearly relative to the former are connected with x, y, z and t by a Galileo 
transformation, apart from a rotation and a displacement of origin. 

Galileo transformations satisfy the Principle of Relativity as far as the laws of 
(Newtonian) mechanics are concerned, but not in relation to the propagation of 
light. Indeed the wave front equation changes its appearance when subjected to 
a Galileo transformation. If Galileo transformations were valid—and the Prin¬ 
ciple of Relativity in its generalized form therefore not—there would exist 
only one inertial system in the sense of our definition and the changed form of 
the wave front equation in any other frame would allow one to detect even 
uniform rectilinear motion relative to the single inertial system—the “ immobile 
ether —and to determine the velocity of this motion. The fact that numerous 
accurate experiments devised to discover such motion relative to the “ ether ” 
all had negative results leaves no room whatever to doubt that the form of 
the law of wave front propagation is the same in all non-accelerated frames. 
Therefore the Principle of Relativity is certainly also applicable to electro¬ 
magnetic phenomena. 

It follows also that the Galileo transformation is in general wrong and should 
be replaced by another. 

The general postulates formulated in the preceding sections provide all that 
is required to derive a transformation of space and time coordinates to super¬ 
sede the Galileo transformation. 

The problem can be stated as follows. Let a reference frame be given which 
is inertial by our definition, i.e. in both the mechanical and the electromagnetic 
senses (see Section 5). We denote the space and time coordinates in it by x, y, 
z and t. Let another frame ( xy’, z', T) be given which is also inertial. The 
connection between (x', y', z\ t') and (x, y, z, t) is to be found. 

The problem of finding a transformation between two inertial frames is 
purely mathematical; it can be solved without any further physical assumptions, 
other than the definition of an inertial frame given in Section 5. 

We shall obtain the solution in two stages: first we shall prove that the 
required transformation is linear and then we shall determine its coefficients. 

From the linearity of the transformation it follows that the motion of the 
new inertial frame relative to the old is uniform and rectilinear. This conclusion 
is essentially merely another formulation of the principle of relativity, according 
to which any frame of reference moving uniformly in a straight line with respect 
to some given inertial frame is itself inertial. Thus the principle of relativity 
is already contained in the premisses of the theorem to be proven. 

8. Proof of the Linearity of the Transformation Linking Two Inertial 
Frames 

It follows from the first condition characterizing inertial frames (the 
mechanical condition), that the property of a motion to be uniform and recti¬ 
linear must be preserved in going from one inertial frame to another. This 
means that the equations 

x = x 0 + v x (t - l „); y ?/o -1- v y (t - t 0 ); z = z 0 + v z {t - t 0 ) (8.01) 
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must lead to transformed equations 

x ' = K + «*(*' - 0 ; y = y' 0 + - $ ; z' = < + <(«' - 0 

( 8 . 02 ) 

The second condition for inertial frames demands that from the equation 

1 / 8u >\ 2 r/M 2 /W / 0co\ 2 1 

r»v») ~ Lw + W) + 1&) 0 

in the imprimed frame there should follow, in the primed frame : 
l/0co\ 2 17 M 2 /M 2 /&»\ 2 1 

-iu +u + y j=° 


(8.03) 


(8..04) 


Now, we can write this second condition in a form analogous to the first; 
consequences of (8.03) are the ray equations (4.08) which express the fact that 
light propagates in straight lines and with constant speed. This must be true 
in both the unprimed and the primed frames. Therefore, the point of inter¬ 
section of a light ray with the wave surface must satisfy equations 


x = x 0 + v x (t - t 0 ); y = y 0 + Vy (t - t 0 ); z = z 0 + v z (t - t 0 ) 


where 




in the unprimed frame and likewise 

*' = x o + v *(t' - 0 ; y = y' 0 + «£(*' - 0> 


where 


(8.05) 


z = \ + v’ z (t' - to) 

(8.06) 


v' 2 + v' 2 + V' 2 = c 2 

X 1 y 1 z 


in the primed one. Thus the second condition amounts to an additional require¬ 
ment that, if in the equations for uniform rectilinear motion v 2 = c 2 , then 
also v’ 2 = c 2 . 

Our task amounts to finding the most general functions 

= M x , y, z,t) 
y ==M*,y,z,t) 
z' = M x , y, z, t) 


(8.07) 


t' =M x , y, z,t) 

such that the primed equations given above follow from the analogous 
unprimed ones.f 

Before tackling the problem we shall pass to a new, more symmetrical 
notation. We put 

x = x \i y = x 2 , z — x 3 (8.08) 

and introduce instead of time a quantity proportional to it 

x 0 = cl (8.09) 

which has the physical meaning of the distance that light would travel in the 
given time. (Note that in the new notation x 0 no longer denotes the initial 
value of a space coordinate.) We write the required transformation as 

3k=fi(x 0 ,x v x 2 ,x 3 ) (i = 0,1,2,3) (8.10) 


t This problem has also been studied by Umov [4] and by Weyl [7], 
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We shall denote the initial values of x 0 , x x , x 2 and x 3 by £ 0 , % lt \ 3 and \ 3 with 
a corresponding notation in the primed system. We introduce an arbitrary 
positive constant y 0 and a parameter s given by s = (c/y 0 )(< — i 0 ) and put 

Y< = To — (» = 1,2,3) (8.11) 

c 

Then the equations of uniform straight line motion read as 

xt = Zi+-ns (i — 0,1,2,3) (8.12) 

and our first condition requires that from (8.12) there should-follow 

< = (* = 0,1, 2, 3) (8.13) 

where the ^ and the y' are new constants and s' a new parameter which can 
be expressed in terms of the corresponding unprimed quantities as soon as 
the form of the functions /< is known. 

The second condition amounts to the requirement that from 

To - (yi + yi + y!) == 0 (8.14) 

there should follow 

Yi 2 “(Yi 2 + Yi 2 + Yi 2 ) = 0 (8.15) 

The transformation equations (8.10) must obviously be soluble for the old 
variables x 0 , ay, x 2 and x 3 , so that the Jacobian of the transformation must 
not vanish 

Det ML ^ 0 (8.16) 

Sx k 

We now proceed to derive the equations for the functions/<. 

Considering the x t as linear functions of the'parameter s given by (8.12) 
we have 

dft __ v- dfi 
d?~ 2, T * dx K 

The derivative 

dx 0 y 0 

must be a constant by our conditions, so we have 

2 Y k(8fildx k ) 

- =Ji = const 

2'{k(c)fJ8x k ) To 
fc =0 

We equate to zero the logarithmic derivative of this expression with xespedt 
to s and obtain 

2 yk'fi{8-fildxic8xi) 2 yk'(i(8 2 fol8x k 8xi) 

k.i-o _.__ fc.r o _ (8.20) 

2 Y fc( 8 fil £x k) 2 Yfc( 8 fol dx k) 

fc=o fc=o 

In this equation the arguments on which the partial derivatives of / 0 and f 


(8.17) 

(8.18) 

(8.19) 
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depend are the quantities (8.12) which in turn depend on the y*. However, 
since the in (8.12) are arbitrary we can consider the y< and the ay to be inde¬ 
pendent variables. (We can reach the same conclusion by putting s = 0 in 
(8.20) and then writing ay instead of ?<.) Therefore the equations (8.20) must be 
satisfied identically in the y* and the ay. 

As functions of the y* the expressions (8.20) are rational fractions, but it is 
impossible for all four denominators, i.e. the four expressions (8.17) for i = 0, 1, 
2,3, to become zero simultaneously for any values of the y«, because the determi¬ 
nant (8.16) is always different from zero. Therefore each of the fractions always 
remains finite, even if its denominator becomes zero. This can only be so if the 
denominator is a factor of the numerator and, therefore, the expressions (8.20) 
are not, in fact, fractions but rational integral functions of the y< which we can 
write in the form 

2|y (8.21) 

1=0 


Thus we have 


V 02 /i 
2_, Y*Yi; 
o 


dxjcdxi 

As this is an identity in the y<, we have 

dfi 


*=0 1-0 


, Oft , 8 fi 

— ’I'k — -4- — 

dxicdxi dxi Y dxic 


( 8 . 22 ) 


(8.23) 


The conclusion is the following : in order that one uniform straight line motion 
should transform into another the transformation functions /< must necessarily 
satisfy the system of partial differential equations (8.23) where the are 
certain functions of x 0 , ay, x 2 and x 3 . 

We now find the condition that must be satisfied by the functions ft in order 
that (8.15) should follow from (8.14). Owing to (8.19), equation (8.15) is 
equivalent to 




t=l A;=0 

This must be a consequence of 

Yo - (Yi + y! + Y§) = 0 


(8.24) 

(8.14) 


As the left-hand sides of both (8.24) and (8.14) are quadratic functions of the 
Y i they must be proportional to each other. To state the consequences of this 
conveniently we introduce the four quantities. 


1 , Ci - - e 2 — e 3 — 1 

and write (8.24) and (8.14) in the form 

% 8 ft „ 

e c{k^i~ — = 0 
i,k, i-o dXk 8x1 


I e k(l = 0 

k-o 


(8.25) 

(8.26) 


Taking the left-hand sides of (8.26) and (8.27) to be proportional we get 


(8.27) 
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* 

'dx/c dxi 


y e,|i t 

OJblr. CCtyl 


(8.28) 


10 


where X is some function of x 0 , x t , x 2 and x 3 and the symbol 8*; has its familiar 
meaning 

hi -- 0 for k * l ; 8** = 1 (8.29) 


\\ c differentiate (8.28) with respect to put 


—-2X<pm 

CXm 


and have then 


(8.30) 


14 $ 


Of* , e 2 /* e/ 4 


kdx m Sxi dxidx. 


jUt)- 2 * 


•fik^kl 


(8.31) 


Inserting herein the expressions (8.23) for the second derivatives and using 
(8.28), we obtain 

fjyfcSim 4- ek'\uhm 4 2 ektymhi = 2e*<p m 8« (8.32) 


This relation must hold for all values of k, l, m. Putting k = l, m 4= l, this 
gives 

tym = tym (8.33) 

Then, putting k — m, l = m, we get = 0, therefore 

4% — tym — 0. (8.34) 


The quantity X in (8.28) and (8.30) must then be a constant and the equations 
(8.23) for the ft become 



cxkdxi 


(8.35) 


The final result is that the combination of the two conditions characterizing 
inertial frames requires the transformation formulae between the coordinates 
in the two frames to be linear. 

The question as to how much follows as a consequence of each of the two 
conditions (i.e. of (8.23) and (8.28)), separately, is considered in Appendix A. 


9. Determination of the Coefficients of the Linear Transformations 
and of a Scale Factor 

Equations (8.28) for the function/* involve a constant multiplier X. In order 
to bring, out explicitly its influence on the transformation we shall write the 
linear functions /* in the form 

x'i =/* = \Zx(ai 4 - 2 e k ai k :r k \ (9.01) 

' k~o ' 

Equations (8.28) will be satisfied identically in X if the coefficients satisfy 
the relation 

3 

2 etaikau = e^ki (9.02) 

f=o 

from which it also follows that 
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3 

2 e i a kiO'U — e/c^ki 

4—0 


The equations (9.01) can be solved for the xi, leading to 


x i — 2 Vkfl'kl 

0 




We also have 

dx <? — ( dx i + dxf + dx g 2 ) = \\dx% — {dx\ + dx\ dxf)\ 

and, for an arbitrary function to : 


(9.03) 


(9.04) 

(9.05) 


/0w\ 2 

\ ox'J 

/ d(x> \ 2 1 8c 
\8xJ \cu 

-Y + fe)1 

: 2 / \ cxj J 


= 

-l(-) 2 - 

X W'dxJ 

[(£)’ + (i 

do>y p«\ 2 l) 
dxj \8xJ Jj 


(9.06) 


The factor X, or rather y X, evidently characterizes the ratio of the scales of 
measurement in the primed and unprimed frames; in either frame the scale 
is a common one for space and time coordinates so that a change in it has no 
efiect on the values of either angles in space or of velocities. We shall show that 
this factor should be equated to unity. 

Consider a point at rest in the primed frame. In the unprimed frame the 
velocity of this point is then the velocity of the primed frame itself. We denote 
the components of this velocity (in the unprimed system) by V u V 2 and V 3 , 
bo that 

Vi = = c for dx[ — dx’ 2 — dx 3 = 0 (9.07) 

dt uXq 

But, from (9.04) we have 

dx i = a 0t dx 0 ; dx 0 = a m dx’ 0 (9.08) 

and, therefore, Vi = c — (i = 1,2,3) (9.09) 

°oo 


It follows in the first place that the transformation (9.01) which gives the rela¬ 
tion between the four coordinates in the two inertial frames, corresponds to 
passing from a given frame to another moving uniformly and in a straight line 
relative to the first, as it should be. 

Further, it follows from (9.09) that the relative velocity of the motion is not 
at all connected with the scale factor X and, therefore, that this factor cannot 
depend on the relative velocity, f Then one has only to assume that for 
vanishing relative velocity, length and time measurements in both systems are 
performed in the same way, in terms of the same units, and one finds X = 1. 
Subject to such very natural requirements X will be equal to 1 for any inertial 
frame, whatever its velocity. 

t It is usually said, following Einstein, that the scale factor can “ evidently ” depend on 
nothing but the relative velocity, and it is subsequently proved that, in fact, it does not have 
any such dependence but is equal to 1. 
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Putting X = 1 reduces our formulae (9.05) and (9.06) to the simpler form 
dx' 0 2 — (dx'f + dx£ + dxf) — dx% — {dx\ + dzf + <fcc§) (9.10) 


and 



(9.11) 


while the transformation equations from either frame to the other become: 

3 

x'i = a t + 2 <c x k (9-12) 

* = 0 
3 

and x ( = 2 e t a ki (4 — %) (9-13) 

Jfc = 0 

These relations carry the name of Lorentz transformations. They constitute 
the formal basis of the whole theory of relativity. 

One should note that the starting point of our discussion was the requirement 
that in every inertial frame the equation describing the propagation of an 
electromagnetic wave front should have the form (5.01). Thence it foEowed 
that the vanishing of the right-hand side of (9.11) led to the vanishing of the 
left-hand side. But by imposing an additional condition to the effect that a 
motion uniform and rectilinear in one frame should remain so in the other and 
that scales of measurement should not alter we found more: the left- and 
right-hand sides of (9.11) not only vanish together, they are identieaUy equal. 


10. Lorentz Transformations 

A Lorentz transformation is a set of equations for transforming the space 
and time coordinates in one inertial frame into those of another that moves 
uniformly and in a straight line relative to the first. The transformation can 
be characterized by the fact that the quantity 

ds 2 = dx | — ( dx\ + dx | + dxl) (10.01) 

or ds 2 — c 2 dt 2 — ( dx 2 -f- dy 2 + dz 2 ) (10.02) 

remains invariant in the strict sense (not only the numerical value, but also 
the mathematical form of the expression remain unchanged.) The most general 
Lorentz transformation is of the form 


4 = «i + 2 e k a ik x k 

(10.03) 

0 

where the coefficients a** satisfy the relations 

3 

2 e iaucau = e^ki 

(10.04) 

<=o 

3 

and 2 ZiQ'kiO'U = ek$ki 

(=0 

(10.05) 


Let us investigate the physical meaning of the constants entering these trans¬ 
formation formulae. 


First of aU, the constant terms a< evidently represent a change of origin for 
space and time coordinates. If we take it that at t -- 0 the spatial origins of 
both frames coincide and t' — 0, we have 

at = 0 (i = 0,1, 2, 3) 


(10.06) 
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In the following we shall make this assumption and shall write Lorentz trans¬ 
formations in the form 

3 

x'i = 2 e k a tk x k (10.07) 

o 

We have already discussed the physical significance of the coefficient ratios 

5*. = - 7, (10.08) 

&oo c 

They give the relative velocity between the two systems, divided by the speed 
of light. More precisely, the Vt are the components in the unprimed frame of the 
vector giving the velocity of the primed frame relative to the unprimed one. 
Since the transformation inverse to (10.07) has the form 

3 

x i ~ 2 e k a ki x k (10.09) 


the quantities V\ given by 


^ = - Vi 

®oo c 


( 10 . 10 ) 


are the components in the primed frame of the velocity vector of the unprimed 
frame relative to the primed one. 

If in (10.04) and (10.05) we put k = l = 0, we obtain 


ago - (af 0 + aio + a§ 0 ) = 1 (10.11) 

and Uoo (®oi + a 2 2 4 - <*§3) = 1 (10.12) 

Whence, using (10.08) and (10.10), it follows that 

F? 4- FI + FI = V [ 2 + V ' 2 2 + V 3 2 (10.13) 

and F 2 = F' 2 (10.14) 


Here F 2 is understood to be the square of the magnitude of the velocity. 
Therefore the relative speed measured in either frame is the same—a result that 
is not obvious although it corresponds entirely with intuitive ideas. 

It follows from (10.08) and (10.12) that 


1 


— 


V(1 - F 2 /c 2 ) 
The square root must be taken positive, for 

8 t' n 

a °°-~si >0 


A negative value of a 00 would imply a change in the direction of time. 
From (10.08) and (10.15) we get 


Vi 

a ot ^/(c* _ F 2 ) 
and, similarly, from (10.10) and (10.15) 

f; 

a<a .V(c 2 -- F 2 ) 


(»= 1,2, 3) 


(10.15) 


(10.16) 


(10.17) 


(t = 1, 2, 3) 


(10.18) 
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Taking one suffix to be zero in the orthogonality relations (10.04) and (10.05) 
and using (10.17) and (10.18) one obtains 


a oo V t = a iJ i + a ( 2 V 2 + a i 3 V 3 (i = 1, 2, 3) (10.19) 

an ^ a 00^i = a H ^1 + a 2i ^2 + a 3i^3 ( l = 1) 2, 3) (10.20) 

If both suffixes are different from zero the orthogonality relations give 

a u a u + a n a 2k + a A k = + ~ jff* 2 (10.21) 

For i, k = 1, 2, 3 we shall now introduce new quantities ait instead of the 
<Hk by 


«it — 0,1k + 


®00 + 1 


F'F 

ait =-<*<* + («oo - 1) -fn* 


( 10 . 22 ) 


(10.23) 


It is easy to verify that as a result of (10.20) and (10.21) we have 

a u a u . + a 2 ,a 2A . + a 3i a 3A . = 8 it (i, k = 1, 2, 3) (10.24) 

Thus the a it are the coefficients of a three-dimensional orthogonal trans¬ 
formation and can be interpreted as direction cosines between the two sets of 
spatial axes with the first suffix referring to the new axes, the second to the old. 
Using (10.19) and (10.20) equations (10.22) give 

Fj. + a i2 F 2 + <x i 3 V 3 = - V' (10.25) 

and a aj F' + a 2J .F' + a 3j V 3 = - F t (10.26) 

These relations can be interpreted as saying that the vector V' (the velocity 
of the unprimed frame relative to the primed one) and the vector V (the velocity 
of the primed frame relative to the unprimed) are equal in magnitude and 
opposite in direction. 

The equations obtained allow us to express all the coefficients of a Lorentz 
transformation in terms of the direction cosines (i, k = 1, 2, 3) and the 
three components V v F 2 and F 3 of the relative velocity. As the nine a<t are 
subject to the six relations (10.24) and hence are expressible in terms of three 
independent quantities, the Lorentz transformation involves six parameters 
in all, not counting the constant displacement terms ai that were set equal 
to zero. 

We have already obtained the expressions (10.15) and (10.17) for the coeffi¬ 
cients a w and a oi . For the a i0 we can take the expressions (10.18) in which 
the Fj must be understood as expressed in the form (10.25), viz. : 

% - - ^,2-T i) 1 *‘< V ‘ v - 1. 2, 3) (10.27) 

For the »<*, finally, we can put 

Oik == - oik + Ko — 1) -yr (10.28) 

or, more explicitly : 
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/ 1 \ 7* A 

a ik = - «ifc ~ ^ (1 _ y 2jci) - Ij y 2 2 ai * F * ( 10 - 29 ) 

Let us insert the values of transformation coefficients just found into (10.07), 
writing ct for x 0 to get back to more descriptive notation. We then get 

= ^ - (1 IfW ) (‘~ ? (Fia:i + v2X2 + ^ (la30) 

The expressions for the spatial coordinates are most conveniently written as 

< = 2 (10-31) 

m = i 

where 

x m~ Xm “ Vmt + _ " y2j ( .2'j ~ 2 F *( Xi: ~~ F *^ (10-32) 

We see that the Lorentz transformation can be performed in two stages. The 
first consists in going from variables (x 1} x 2 , x 3 , t) to new variables (a;*, x*, a;*, t*) 
where x*, x* and x* are given by (10.32) and t* is equal to t' and given by 

l * = ( f ~ ^ ( ' VlXl + V ^' i + V * Xa )) (10.33) 

The second stage consists in going from (a;*, x*, x*, t*) to (x[, x 2 , x 3 , t’) where 
by (10.30) and (10.31) 

x i = *n x * + V* + V*’ l ' = l * (10.34) 

The second stage is evidently simply a rotation of the spatial reference frame, 
while the first is a transition to a frame moving with the velocity (\\, V 2 , F 3 ) 
without any rotation of spatial axes. 

Of course we could also have performed the rotation of axes first and then 
gone over to the moving frame. 

The reverse transformation can also be performed in two stages, firstly going 
from (x[, x 2 , x 3 , f) to (a;*, xf, x*, l*) by the equations 

x f = % X 1 + v'i + vJ; f * = l ’ (10.35) 

which represent simply a rotation of axes, and secondly passing from 
(a;*, x*, x*, t*) to (Xj, x 2 , x 3 , t ) by means of 

x m = < + VJ* + ~- 1 ) ys 2 F * (x * + (10 - 36) 

and 

' - ^(i y,(,,j (<* + 1, (V,*r + Frf + F**)) (10.37) 

which are the inverse formulae to (10.32), (10.33). The transformation from 
(xf, x*, xf, t*) to (x I; x 2 , x 3 , /) and its inverse go over into each other if the sign 
of the velocity V is reversed. 

One should note that the Jacobian of these transformations is unity. As 
regards the substitution (10.34) and its inverse (10.35), the Jacobian will be 
+1 if they describe a rotation of axes in the proper sense, i.e. without an 
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accompanying passage from a right-handed frame to a left-handed one, or 
vice versa. Therefore, if the direction of time is preserved and also the right- 
handed (or left-handed) character of the spatial frame, the Jacobian of the 
Lorentz transformation will be equal to 1. Such Lorentz transformations are 
customarily called proper. In the following we shall only use proper Lorentz 
transformations. 

A rotation of axes is not properly a transition to a new inertial frame, so it 
is of no interest to us. The characteristic properties of a Lorentz transformation 
are contained in equations (10.32) and (10.33) and their inverses (10.36) and 
(10.37). These equations simplify if the directions of the axes are so chosen 
that one of them, e.g. the first, is in the direction of relative velocity. Putting 

V \ = 7; F 2 = F 3 = 0 (10.38) 


we get from (10.32) and (10.33) 


aq — Vt 

Vd :v vvci 


t - Faq/c 2 
V(1 - 7*/c 2 ) 


aq — aq; aq — aq 


(10.39) 

(10.40) 


The inverse equations are obtained by replacing F by — F. For greater 


descriptiveness 

we shall write x, y, z instead of aq, aq, x 3 . Replacing the asterisk 

by a prime, we 

then get 




, x — Vt 

X V(1 - 7 2 /c 2 )’ 

tQ 

II 

II 

(10.41) 

and 

t , t — Vx/c 2 

" V(1 - 7 2 /c 2 ) 


(10.42) 

and, expressing x, t in terms of x', t ': 




x' + Vt' 

x V(1 - 7 2 /c 2 )’ 

II 

II 

(10.43) 

and 

t' + Fz'/c 2 

1 V(1 - F 2 /e 2 ) 


(10.44) 


The transformation given by these formulae is of a special kind, but it contains 
all the characteristic features by which the new theory differs from the old. 

It is convenient to write the formulae for Lorentz transformations in vectorial 
form. Using familiar three-dimensional vector notation we get 


t' = 


>/(i - v */ c2 n c 


• r) 


r ■— V< -j- 


(_ 1 _ 

w(l — F 2 /c 2 ) / F 2 


lhr.((V-r)-FH) 


(10.45) 

(10.46) 


11. Determination of Distances and Synchronization of Clocks within 
One Inertial Reference Frame 

Before going on to discuss consequences of the Lorentz transformation, let 
us return to the questions already raised in Sections 1 and 2 concerning the 
measurement of distances and time differences in a single reference frame. 
We shall dwell a little more on the concept of simultaneity at different points in 
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space and on the problem of synchronizing clocks at a distance from each other. 

There are different methods of measuring distances between bodies at rest 
in an inertial frame : one can directly apply a graduated rule or “ measuring 
rod ”, one can use triangulation, or else one can use radar. The first method 
relies only on the properties of rigid bodies, the second depends also on the 
rectilinear propagation of light, in the third the knowledge of the speed of light 
plays an essential part. It goes without saying that all methods assume the 
validity of Euclidean geometry ; if it were not valid then, for instance, different 
triangulation procedures could give mutually contradictory results. We have 
stressed in Section 2 that the validity of the laws of Euclidean geometry should 
be taken as an experimental fact and not as an a priori assumption. 

All three methods of measurement are based on the properties of solid bodies 
and of light. Since light is the simpler phenomenon one should consider it as 
the primary entity and should, for instance, check the constancy of a standard 
of length by optical means—by finding the number of wavelengths that fit 
on to the standard. 

To measure the speed of light one must know how to measure distances and 
time intervals by methods independent of the knowledge of that speed. If, 
however, one assumes that the speed of light is the same in some direction and 
its reverse, it is enough to be able to measure time at one point. Any periodic 
process, in principle, gives one this ability, for instance, the vibration of an 
ammonia molecule or of a quartz crystal. We shall call a device that measures 
time a clock or chronometer, regardless whether it is in fact a mechanical 
contrivance, such as a common clock, or whether it works on some other principle. 

The measurement of the speed of light basically amounts to the determination 
of the interval t during which light passes to and fro along a path of length r, 
previously measured by triangulation or with a measuring rod; the speed of 
light will be c = 2r/v. Thus the problem amounts essentially to finding a 
conversion factor from a distance expressed in units of length and measured by 
triangulation or by direct application of a measuring rod, to a distance expressed 
in units of time and measured by radar methods. This conversion factor can 
be determined once and for all. 

We now pass to the question of comparing the readings of clocks at rest in 
the same inertial frame, but situated at a distance from each other. We assume 
that the clocks are of identical construction and run at the same rate, so that 
the question is simply that of synchronizing them. Let “ clock A ” be at the 
point A and “ clock B ” be at the point B, a given distance r from A. A light 
signal is emitted from A at time t t ; it is reflected by a mirror placed at B and 
returns to A at time t t . What time should be indicated by the properly syn¬ 
chronized clock at B at the moment the signal reaches that point ? We shall 
assume that clock B is synchronous with clock A if at the moment the signal 
arrives it shows the time (' = (< x + « a )/2. This is Einstein’s definition of 
synchronization for two spatially separated clocks. It is a very natural defi¬ 
nition and is in accord with the basic postulate of the Theory of Relativity 
that the speed of light is constant. For, if the signal leaves A at time t, it must 
reach B at the time 

r 

t — h + - 
c 



32 


The Theory of Space Time and Gravitation 


having traversed the distance r with speed c. On the other hand, as it returns 
to A at time t 2> having previously traversed the distance r from B to A, it 
must have been reflected from B at the earlier time, 

r 

l =t, - 

2 c 

It follows from these two equations that we must attribute to the time of 
reflection of the signal at B as measured by clock A the value t = (t v -(- 1 2 )/2, 
and if clock B is to be synchronous with clock A it should show the time 
t' = / at the instant of reflection. 

From the fact that the speed of light is the same going to and fro it follows 
that the above defined property of being synchronous is a reciprocal one of the 
two clocks : if clock B is synchronous with clock A, then clock A is also 
synchronous with clock B. Further, since Euclidean geometry is valid and the 
speed of light is the same for any direction of travel (both these facts are implied 
in the equation for the propagation of the front of a light wave), it follows that, 
if clock A is synchronous with several clocks B, C, D • • , all these clocks are 
mutually synchronous. 

This last fact makes it possible to envisage as a model of an inertial reference 
frame a rigid scaffolding having at each of its junctions a clock, with all clocks 
synchronous. Using such a model the motion of any body relative to the inertial 
frame can be described as follows : let the body move in turn past the clocks 
that have spatial coordinates (x x , y t , zj, ( x 2 , y 2 , z 2 ), etc., with the clock at (X{, 
yt, Zi) showing time at the instant of the body’s passage. Then the coordinates 
x(t), y(t) and z(t) of the body will be such functions of time as to have the values 
(xt, yu zi) at the times t%. 

For all its unwieldiness such a model can be of use and is often discussed in 
treatments of the Theory of Relativity. However, we prefer the model of a 
radar station because it allows a continuous determination both of the position 
of a moving body and of the corresponding time, by the station clock—a deter¬ 
mination based on a single point, the position of the station. Because the speed 
of light is independent of the velocity of the source both models give the same 
values of space and time coordinates, the difference between them is merely 
that the radar model does not need any advance determination of distance, or 
synchronization of clocks, but instead, so to speak, performs these functions as 
the motion proceeds. 

The radar model is the more flexible one and retains its intuitive value even 
in cases when the rigid scaffolding is quite inappropriate, for instance, when 
discussing astronomical distances. In addition, the radar model is one that 
permits of generalization. 

The above exposition of Einstein’s method of synchronization by light signals, 
with due account of retardation, appears so natural that one might think at 
first sight that it contains nothing characteristic of the Theory of Relativity. 
However, this is not so. The method indicated contains a definition of simul¬ 
taneity at different points in space from the standpoint of a given inertial 
system. This definition rests on the laws of the Theory of Relativity and is 
not arbitrary, but, on the contrary, is the only rational one from the point of 
view of this theory. 
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In pre-relativistic physics it was assumed as self-evident that a single universal 
time exists and therefore it was tacitly supposed that there is no need to define 
simultaneity at different points in space. Consequently it was assumed that 
any method of synchronization, for instance by transport of clocks instead of 
by light signals, would give the same result. In fact this is not so. 

We shall show later that the' Theory of Relativity predicts that if clock A 
is synchronized with clock B by light signals and a chronometer C is checked 
at A with the clock there and then transported to point B, its reading when 
at B will not agree with that of clock B, even assuming ideal working of the 
chronometer. The readings of C will depend on the speed of its transport and 
will coincide only in the limit that this speed is infinitely small. 

We shall return to this question in the next section. Here we wished to stress 
that even such a simple physical concept as simultaneity in a single inertial 
frame requires precise definition, and that all methods employed to measure 
the corresponding physical quantity must be in accord with this definition. 

12. Time Sequence of Events in Different Reference Frames 

The Lorentz transformations contain the formal basis of the views on space 
and time that follow from the Theory of Relativity. 

Using these transformations we now consider the question of the time 
sequence of events in different reference frames. By “ event ” we mean an 
instantaneous occurrence that can be characterized by a point in space and a 
corresponding moment of time. 

To have a concrete picture, we shall imagine that the “ events ” consist of 
the instantaneous flashing of fight signals. Let the first flash happen at time 
<1 at a point with coordinates x v y 1 and z t and the second at time t 2 at the 
point (r 2 , y 2 , z 2 ). Introducing ordinary three-dimensional vector notation we 
can characterize the place and time of the first event by the symbol (r x , £ x ) 
and those of the second by (r 2 , t 2 ). 

We first ask : which of the two flashes occurs first ? The answer will be 
unambiguous if the fight from one of the flashes has time to arrive at the place 
of the other before the latter occurs. If 

1 , 

t 2 <j>- r 2 - rJ (12.01) 

c 

the-first flash is incontestably earlier than the second, and of course if 

k >i < - - |r 2 r, | (12.02) 

c ' 

the flash (r„ t 2 ) certainly takes place later than (r 2 , t 2 ). 

We shall talk of pairs of events for which either (12.01) or (12.02) is true as 
being in time sequence. In the case (12.01) we shall say that the second hap¬ 
pening is absolutely later than the first and that, if (12.02) holds, it is absolutely 
earlier. In both cases one has 


e~{t 2 — M 2 - (r 2 rj)- > 0 
The real positive quantity 

T — - V[c 2 (k - t,Y - (r 2 - Tj) 2 ) 
c 


(12.03) 


(12.04) 
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is called the time-like interval between tbe two events. The interval between 
two events in time sequence is independent of the reference frame because the 
expression under the root in (12.04) is invariant under Lorentz transformations. 

By the condition dt'jdt > 0, which expresses the conservation of the direction 
of time (see (10.16)), the relations (12.01) and (12.02) are also independent of 
the frame. This must be so because the question whether a signal has or has 
not arrived at the position of the other event is a physical one and cannot 
depend on any reference frame. Thus the notions “ absolutely earlier ” or 
“ absolutely later ”, applied to events in time sequence, are invariant. 

Let us assume now that the light from one flash does not reach the place of 
the other flash before the latter occurs. 

Then inequalities opposite to (12.01) and (12.02) will hold : 

— - I r 2 — r x I < < 2 — f x < - I r 2 — iq | (12.05) 

c c 

Pairs of events for which the inequality (12.05) is true will be called qitasi- 
simultaneous. This name is justified by the fact that in this case the notions 
“ earlier ” and “ later ” become relative ones : one may find t 2 — f x > 0 in 
one reference frame and t 2 — t x < 0 in another. The question as to which 
flash happened first has now no unique answer. 

Quasi-simultaneous events can be characterized by the invariant inequality 
c\t 2 - t ,) 2 - (r 2 - r x ) 2 < 0 (12.06) 

which follows from (12.05). The two relations, (12.05) and (12.06), are equi¬ 
valent and, therefore, (12.05) is also invariant. We shall call the real positive 
quantity 

R = V[(r 2 - r x ) 2 - c 2 (f 2 - t ,) 2 J (12.07) 

the space-like interval between two quasi-simultaneous events. 

We shall now show that if two events are quasi-simultaneous it is always 
possible to choose a frame of reference such that the events are simultaneous 
in it, and that if two events are in time sequence a frame can be found in which 
they occur at the same point in space. 

We consider two quasi-simultaneous events with a given value for the time 
difference f 2 — £ x , which must, of course, satisfy (12.05). We introduce a new, 
primed, reference frame moving with speed V relative to the first frame. 
According to the transformation of time given by (10.30), the value of the 
difference t' — t[ in the new frame will be 


t 2 fj 


V(1 -- P 2 /c 2 ) 




(12.08) 


The relative velocity V can be so chosen as to make the two events simul¬ 
taneous in the new frame. To do this, we merely have to put 


V - (r 2 .~ Tj) 

By virtue of (12.06) we have 


V 2 = 


, r 2 
c 4 (f 2 < x )- 


2 ' H) 

- r ll f 

(12.09) 

< c 2 

(12.10) 


so that the required relative speed is less than the speed of light, as is required 
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for a Lorentz transformation. 

To calculate the difference of the spatial coordinates of the two quasi-simul- 
taneous events in the system in which they are actually simultaneous we write 
the equation (10.32) for the Lorentz transformation in the vectorial form : 


r r Vt + (v(i - v «/c*) l ) V 2 (v 


VH) 


( 12 . 11 ) 


Inserting the difference t 2 —- t x for t, the difference r 2 — r x for r and expression 
(12.09) for V we obtain for r' = r' — r' the expression 


r 2 - = (r 2 




In the primed system the spatial distance is thus seen to be 


(12.13) 


where R is the space-like interval defined in (12.07). Equation (12.13) also 
follows directly from the invariance of the expression (12.07) for R and the 
condition t 2 — t' t = 0. 

Thus the physical significance of the space-like interval between two quasi- 
simultaneous events is that it is their spatial distance in the frame in which 
they are simultaneous. 

Let us now consider two events in time sequence; for these either (12.01) 
or (12.02) must hold. We shall show that here a new, primed, frame can be 
introduced in which the spatial coordinates of both events coincide. To see 
this it is sufficient to introduce the difference t 2 — t x instead of t, and r 2 — r x 
instead of r into (12.11) and to put 


V = 



(12.14) 


Then the vector r' = — r' vanishes, hence 

r 2 = r; (12.15) 

Because of the inequality (12.03) the magnitude of the velocity V is less than 
the speed of light 

F* <C 2 (12.16) 


as it should be. 

In the new frame the time difference between the two events is, by (12.08): 

« «-<•.-*■> -"■-'■>Vi 1 -5) <m,) 

Hence, if t 2 — t x > 0, we get 


t' 2 — t[ — T 


(12.18) 


where T is the time-like interval (12.04). 

In this way the physical significance of the time-like interval is put in evidence. 
It is the time elapsed between two events in time sequence, measured in that 
frame in which both events occur at the same point in space. This reference 
frame can be very directly visualized. It consists of, say, a clock moving 
uniformly in a straight line from the place of the first event to that of the second 
with such a speed as to be exactly at the first point when the event there occurs 



36 


The Theory of Space Time and Gravitation 


and likewise at the second point at the time of its event. 

One should note that the relation of being in time sequence is transitive: 
given that a second event occurs absolutely later than a first and a third abso¬ 
lutely later than the second, it follows that the third is absolutely later than 
the first. This property is physically obvious, but can also be proved as follows : 
The two inequalities 

t% — > - | r 2 r i | i ^3 ^2 > ~ | r 3 r 2 1 (12.19) 

lead by addition to the third, 

«.-ti>-|r,-r 1 | (12.20) 

since the sum of two sides of a triangle is greater than the third side. On the 
other hand the relation of being quasi-simultaneous is not transitive. If two 
events are quasi-simultaneous and a third is quasi-simultaneous with one of 
them, it may also be quasi-simultaneous with the other, but it could also be in 
time sequence with it, i.e. happen absolutely earlier or absolutely later. 

Let us summarize. In the theory of relativity one divides events according 
to their instants of occurrence into quasi-simultaneous events and events in 
time sequence; this distinction does not depend on any particular reference 
frame. Two events in time sequence possess an invariant time-like interval, 
equal to their time difference in a certain reference frame. Two quasi-simul¬ 
taneous events possess an invariant space-like interval equal to their spatial 
distance in a certain frame. 

This division in Relativity of pairs of events into two classes agrees with the 
notion of causality and makes it more precise. Given two quasi-simultaneous 
events neither can be the immediate cause or effect of the other. (Of course, 
they may both have as a common cause some third event that occurs absolutely 
earlier than either.) Only pairs of events in time sequence can be directly causally 
related, the one that occurs absolutely earlier may (but, of course, need not) 
be the cause of the other. 

The concepts we have introduced are natural generalizations of the notions 
of pre-relativistic physics. The older ideas were not in accord with the fact 
that a finite limit exists for the speed of propagation of all kinds of action, the 
finite speed of light. They were not defined precisely enough and therefore gave 
rise to paradoxes. The new scheme of ideas incorporates the finiteness of the 
speed of light and so removes the paradoxes. 

In older physics the idea of simultaneity was introduced a priori, without 
experimental justification. We shall consider one of the paradoxes to which 
the idea led. Assume we have two reference frames in uniform rectilinear 
relative motion and such that at a certain instant their origins coincide. Let a 
flash of light occur at that instant at the common origin. We consider the 
motion of the wave front from the point of view of the two frames. Seen from 
the first frame the wave front at any instant of time is a sphere centred at the 
origin of this frame ; but the same can be asserted with reference to the second 
frame—at any moment the wave front is a sphere centred at the other origin. 
The two origins only coincide at one instant, later they move apart, but a sphere 
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can only have one centre. This discussion uses, on the one hand, the Principle 
of Relativity, i.e. that in two frames in uniform rectilinear relative motion all 
events occur in the same manner, on the other the principle that the speed of 
light is independent of the motion of the source—a consequence of the idea of 
a field. Both these principles are quite incontrovertible. How then is the 
paradox resolved ? 

The resolution is to be found in the fact that the words “ at any instant of 
time ” have a different significance when applied to one or the other frame. 
The paradox arises only if the phrase is taken to mean “ at any instant of a 
unique absolute time, common to both frames ”. The existence of such a 
unique absolute time was taken for granted in pre-relativistic physic:. ; it is 
denied in Relativity. We have learnt that in Relativity time in one frame is 
not the same as time in another. In each frame the wave front is defined as 
the locus of all points reached by the light disturbance simultaneously ivith 
respect to the given frame. Since events that are simultaneous in one frame are 
not so in any other, the wave front in the one frame will consist of points different 
from those of the front in the other frame. If we make the picture that the wave 
propagation takes place in some rarified medium like a gas the wave fronts in 
the two systems will cover different sets of particles. Since we have two 
different wave fronts, it is not surprising that they have different centres. 

This resolution of the paradox shows up vividly the logical necessity of 
renouncing the ideas of “ absolute time ” and “ absolute simultaneity ”. 

Half a century ago when the Theory of Relativity was just emerging, this 
renunciation appeared to many as wellnigh unacceptable and it required great 
scientific boldness on the part of Einstein, who created the theory, to convince 
himself of its necessity. Today we accept the renunciation much more readily. The 
everyday notion of simultaneity is covered by the concept of quasi-simultaneity 
while in scientific matters, where we deal with great distances or great speeds, the 
necessity for a more precise definition of time appears quite natural. 

13. Comparison of Time Differences in Moving Reference Frames. 

The Doppler Effect 

Let us assume that a certain frame of reference or base is given, which is 
taken to be at rest. We observe from this base the development of some process 
occurring in a moving system. As such a process we could take the uniform 
running of a clock attached to a moving frame of reference. To follow the 
behaviour of the moving clock one could arrange for it to emit a light signal 
every second, or one might be able simply to observe its dial from a distance. 
In addition the base needs to be equipped with some means, say radar, to 
record continually the distance of the clock from base. We can then observe 
with a clock at the base the times of arrival of the light flashes from the moving 
clock and correct each of these recorded times for the time of travel of the 
signal. In this way we can determine the times t n at which the supposedly 
second-by-second flashes were emitted. The correction mentioned is very easy 
to calculate. 

Let r(t) be the distance to the moving clock as measured from the base, 
expressed as a function of base time. Then t n is obtained from the directly 
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observed arrival time t* by the equation 

tn + — = t* (13.01) 

c 

The question is now whether the times t n will be spaced at intervals of just one 
second. In other words, will the calculated times t„ coincide with the seconds 
of a clock at the base identical with the moving clock under observation ? 

The equations of the Lorentz transformation answer this question. Assume 
that the clock is moving away from the base along a straight line passing 
through the origin of the base coordinates. (We take it that the signals are 
being received at the origin.) Taking this straight line as the cc-axis and 
assuming that the moving clock was at the origin at t — 0, the position of the 
clock is given by the equations 

x — vt\ y — 0; z = 0 (13.02) 


and x is just the distance of the clock from the base. Using equation (13.01) 
we see that the signal received at the base at time t was emitted at the instant 


‘■"TT^c <U03 > 

reckoned by the base clock. Using the equations of the Lorentz transformation 
we now introduce a reference frame fixed to the moving clock : 

x — vt , t — vxjc 2 


x — 


t' = 


V(1 - « 2 /c 2 ) ’ ” V(! - ^/c 2 ) 

As we know, the equations inverse to these are : 

x' + vt' _ t' + vx'fc 2 


V(1 - v 2 /c 2 ) ’ 


t = 


V(1 - « 2 /c 2 ) 


(13.04) 


(13.05) 


In the primed system the position of the clock that emits the signals will be 
x' = 0, y' — 0, z' = 0 and the instants of signal emission are 

t' n = m (13.06) 


where the constant t depends only on the structure of the clock, not on its 
motion ; it equals one second in the example of the text. Therefore we have 


nx 

i/(l - « 2 /c 2 ) 


(13.07) 


On the other hand an identical clock at the base would record the times wt. 
Therefore if the moving clock is observed from the base it would appear to be 
running slow compared to the base clock. The emission times that are observed at 
base will be not t (one second) apart, but less frequent; the spacing being given by 

M =--- (13.08) 

V(1 - v*lc*) K 1 


One should remember that this expression applies after correcting for the finite 
speed of the signal with the aid of (13.01). The actual instants at which the 
signals are observed to arrive at the base are given by 




(13.09) 



39 


The Theory of Relativity 


so that their difference is At* 


- *V(S) 


(13.10) 


To make the discussion simple we have dealt with the case of the clock moving 
directly away from the observer (i.e. from the device registering the light 
signals). We now consider the more general case that the trajectory of the 
dock does not pass through the base. Let the receiving device still be situated 
at the origin, but let the moving clock coordinates now be given by the equations 
x = vt; y = y 0 \ z = z„ (13.11) 

in the base frame. We may still put 

x' = 0; y' = 0; z' = 0 (13.12) 

in the moving frame. 

As before one passes from one frame to the other by means of the Lorentz 
transformation (13.04) or (13.05), the only difference being a shift y — y' = y 0 , 
z — z' = z 0 along the axes of y and z. Therefore, the connection between t' and t 
and thus also (13.08), giving the rate of the moving clock, remain unchanged. A 
change appears only in the connection between t* and t, i.e. between the instant 
of emission and the instant of reception. We shall now have instead of (13.09) 

+ !\/(^ +yl+ 4) ( 13 . 13 ) 


** = *» 


If we take the time difference t to be small we can put 


* dt* K 
At* = — At 


where 


dt 
dr 


-(l+^A« 


v 2 t 


(13.14) 

dt + xy% + zl) (13.15) 

is the radial component of the clock velocity at the instant the signal is emitted. 
Prom (13.08) and (13.14) we get 

C 4- Vr 

At* --- t- (13.10) 


V (e 2 — v 2 ) 

This replaces (13.10) in the general case. 

The fact that At* and t are different is an expression of the well-known 
phenomenon of the Doppler effect, which is the following : if a periodic process 
of period t takes place in a moving frame then the period At* registered at some 
point in the rest frame proves to be greater than t, if the motion of the system 
is away from the point and less if the motion is towards it. Equation (13.16) 
gives the relativistic expression for the Doppler effect in the case of light 
propagating in vacuo. The classical Doppler formula is obtained if no dis¬ 
tinction is made between t' and t and At is replaced by t in (13.14). This 
introduces a relative error of order i; 2 /c 2 . 

One might remark that historically the first determination of the speed of 
light in empty space was performed in 1675 by Olaf Roemer using a method 
essentially based on the Doppler effect. The system of Jupiter’s satellites 
served as the moving clock and the periodic process observed consisted of their 
eclipses. During roughly half of its orbital motion the Earth approaches 
Jupiter, so that v r < 0, and for the other half of the time it moves away and 
v, > 0. Using the simplified expression (13.14) instead of (13.16) one can take 
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for At the annual mean of the observed time between eclipses. On that part of 
the Earth’s orbit on which v r < 0, the time At* will be less than At and the 
eclipses will be observed ahead of the expected time, while it will be greater 
on that part where v r > 0 and the eclipses will then come late. Thus the relation 
(13.14) allows a calculation of the speed of light c. In this way Roemer found a 
value of 3-1 x 10 10 cm sec -1 , very close to the now accepted value of 3-0 x 10 10 
cm sec 1 , which is obtained by much more accurate methods. 

Let us stress once again that in the theory of the Doppler effect two con¬ 
siderations are taken into account: firstly, the connection between the instant 
f* when the light signal arrives, and the instant l of its emission, and, secondly, 
the relation between the time t in the frame in which the signal is registered 
and the time t' in the frame moving with the emitting body. The first of these 
considerations, passing from t* to t, introduces a factor depending on the radial 
component of the velocity; it was already known in pre-relativistic theory. 
The second consideration, transforming from t to l', introduces a factor depending 
on the magnitude of the velocity ; it arises from the Lorentz transformation 
and is specific to the Theory of Relativity. 

14. Comparison of Clock Readings in Moving Reference Frames 

In the preceding section we examined a method of comparing time differences 
in different reference frames which relied on the use of light signals and took 
due account of their time of travel. However, it is also possible, in principle, to 
use another method in which one only compares the readings of clocks when, 
in the course of their motion, they are in immediate proximity of each other. 

We imagine a row of clocks spaced out along a straight line and all belonging 
to the same frame. They are to be at rest in this frame and are supposed 
synchronized from the beginning. Suppose a clock A moves past the row of 
clocks. To obtain the rate of clock A it is sufficient to compare at any instant 
the reading of A with the reading of that base clock which A happens to be 
passing at the moment in question. Clearly it will be enough to consider two 
clocks in the base, so that we can think of the latter as formed of two clocks 
joined by a rigid bar. 

Referred to the base, the coordinates of the two clocks at rest and of the 
moving clock A will be 

x -- a (first base clock) 

x = h (second base clock) (14.01) 

x = vt (clock A) 

Using the Lorentz transformation formulae (13.04) and (13.05) we can introduce 
a reference frame connected with clock A. In it the coordinates of the same 
three clocks will be 

x' — — vt,' + a' (first base clock) 

x -- — vt' + b' (second base clock) (14.02) 

x' — 0 (clock A) 

with the constants a' and b' related to the a and b of (14.01) by 



a 


(14.03) 
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Without making any choice between the two reference frames we can register 
the following objective facts: first, the readings of clock A and of the first 
base clock at the instant when they are coincident and, second, the readings of A 
and the second base clock when in turn they coincide. We denote the two 
described readings of A by t' x and t 2 and by t x and t 2 the corresponding readings 
of the first and second base clocks respectively. Then we get, using (14.01) 
and (14.02) 


a 

*i = -; 

V 
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b 

h = -\ 

V 

ti = b : 

(14.04) 

V 

whereby, owing to (14.03) 

V 



l ' 2 = (* ~ ?) 

(14.05) 


These results enable us to check, on the one hand, the rate of clock A observed 
from the base, and on the other hand, the rate of the base clocks observed from 
a frame fixed to A. We mean by the rate of a periodic process as observed from 
a particular frame, the rate expressed in terms of that time variable which is 
defined by the synchronization appropriate to the frame. 

The quantities l x and t 2 are readings of different base clocks, but as the two 
docks are synchronized in the base frame, we can maintain that when the 
second clock read t 2 , so did the first, using the word “ when ” in the sense of 
that particular synchronization. Therefore the difference t 2 — t r is just the time 
dapsed in the base frame reckoning during which the reading of clock A 
advanced by t' 2 — t' r Thus the rate of clock A observed from the base is 
determined by the equation 

t' 2 - t[ - («, - tl )J(l - (14.06) 

Since \t 2 — t 1 \ < | t, 2 — t l |, clock A will be slow to an observer at the base. 
Putting t 2 — t 1 — t, and t 2 — At, our equation coincides with (13.08). 

We now consider the rate of a base clock observed from a frame attached at A. 
To check this rate it is now essential to observe the readings of one and the same 
clock at base, either the first or the second. For definiteness let us consider the 
second. We have only one direct reading for it, namely the one for the instant 
when it was coincident with clock A, and then it read t 2 , while clock A read t' r 
The other reading of the second clock must be calculated from the available data. 
We therefore ask : where was the second clock and what was its reading when 
the first clock was coincident with clock A ? It is essential to remember that 
the words “ where ” and “ when ” in the above must now be used in the sense 
appropriate to the frame of A, the primed frame. The question is readily 
answered. When the first clock and A coincided, the latter indicated the time 

= a'/v, and then, according to (14.02) the second clock was at the point 
x' = b' — a’. The reading of the second clock is then obtained by inserting 
t' = t 1 and x' = b' — a’ into the Lorentz transformation (13.05). 
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We get 

. t'i + (vlcW -a') t , N 

t ~~ 2/ 2 > h + 2 (o <*) 

y(l — v 2 lc z ) c 2 

(14.07) 

or 

v 2 

t = ti (t 2 ti)~i 

c 2 

(14.08) 


(The reading of the second clock now does not agree with the reading of the 
first, which is t v because now simultaneity is understood in the frame of A 
and not of the base.) From the reading of the second clock, as given by (14.08), 
together with the directly observed reading t 2 , one finds the rate of the second 
clock in the frame of A. We have 


<«—«=(<* — *x)(l — (14.09) 

whence, finally, t 2 — t — (t 2 — t[) ^/| l — (14.10) 

Thus in the frame of A the clocks at base are again slow ; we have complete 
reciprocity between the two frames. 

It is sometimes said that time passes more slowly in the moving frame than 
in the frame at rest. As the Principle of Relativity allows one to interchange 
the roles of the two frames, this formulation is contradictory. 

The nature of the kind of misunderstanding here involved can best be clarified 
by means of a mathematical illustration which has a direct bearing on our 
question. We saw in Section 10 that in a Lorentz transformation 


8t' 1 

~dt~ a 00 ~ V(i ~ « 2 /c 2 ) > 

but that for the reverse transformation also 


(14.11) 


8t 1 l 

~8t’ ~ a °° ~ V(1 - v 2 /c 2 ) > 


(14.12) 


If one forgets that in (14.11) the differentiation takes place at constant x, y, z 
and in (14.12) at constant x', y' and z', it seems strange that dt/dt’ is not the 
reciprocal of dt'jdt but equal to it. It is clear, however, that there is in fact 
no paradox. 

Returning to the physical side of the matter one can say that the problem 
in question is not concerned with “ passage of time ” in different frames but 
with the description of some localized process within different frames. Let the 
process be localized at a point at rest in the unprimed system so that x, y and z 
are constant. Then we conclude from dt'j 8t > 1 that the duration, or period 
of the process in “ its own ”, unprimed frame will be less than in any other 
(primed) frame that is in motion relative to the former. If, however, the process 
is localized at a point with constant coordinates x', y\ z' then its “ own ” or 
“proper” system will be the primed one and we have dtjdt' > 1 , but the 
conclusion is the same. 

If the duration of the process in its “ proper ” system was dz, then it will be 
dt > dz in another system moving with speed V relative to the first where 
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Here V is the velocity that enters the Lorentz transformation which relates the 
two frames. The magnitude of V is equal to that of v the velocity of motion of 
the point at which the process is localized. The components of v are 


dx 

Vx== dt' 


ay 

Vy = Tt’ 


Vz 


dz 

dt 


(14.14) 


We can therefore write, instead of (14.13) 

(1416) 


It is easy to see that this expression is invariant under Lorentz transformations. 
The quantity dx can be thought of as the differential of a “ proper time ” t 
given by 

,M16) 

o 

where we have assumed that t — 0 at t — 0. If the velocity v is constant 
then t is the duration as measured in its “ proper ” frame of the process associ¬ 
ated with the moving point-henee the name “ proper time If, however, v 
is a variable velocity the physical meaning of t is not obvious. Even so it can 
be looked upon as an auxiliary mathematical quantity which it is convenient 
to use because of its invariance under Lorentz transformations. One still retains 
the name “ proper time ” for it in the case of variable velocity v. 

If a moving body, a clock, is in a gravitational field which is the cause of its 
acceleration, one can give a different expression for the time shown by this clock, 
an expression which in addition to the velocity v, also involves the gravitational 
potential (see (62.03)). The expression (14.16) will not be correct in this case. 
The correct expression may be justified in Einstein’s theory of gravitation, 
although it is not a direct consequence of that theory (see Section 62). As for 
the ordinary theory of relativity, it allows one to draw conclusions in a general 
form (i.e. without going deeply into the nature of the processes occurring) only 
in relation to unaccelerated motion. In general, however, no theory is capable 
of predicting how a clock will behave when subjected to impacts or arbitrary 
accelerations without entering into the details of the clock’s construction. This 
cannot be achieved by the theory of gravitation either; the expression given in 
Section 62 for the time shown by a clock in accelerated motion relates to the 
case when the acceleration is caused by the gravitational field. 

One could also introduce the notion of clocks nearly or, in the limit, entirely 
insensitive to impacts and accelerations (for instance atomic systems with large 
internal frequencies). One could then propose to measure proper time t by the 
readings of such an ideally insensitive clock, this being just the physical meaning 
of proper time. But one should note that such a proposal, although not in 
contradiction with the theory of relativity, does not follow from it and represents 
a special hypothesis. We shall return to this question in Sections 61 and 62. 
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15. Comparison of Distances and Lengths in Moving Reference Frames 


If an object is at rest in a given frame of reference the determination of its 
geometrical shape or dimensions need not be performed instantaneously. 
An object at rest may be measured gradually, by noting the positions of various 
points one by one. On the other hand, if one has to measure the dimensions 
and shape of a moving object it is absolutely necessary that the observations of 
all its points should relate to the same instant, otherwise one obtains a distorted 
picture. 

It is thus clear that the concepts of dimensions and shape of a moving object 
are closely related to the concept of simultaneity. We know already that the 
notion of simultaneity is not something absolute but depends on the reference 
frame. Therefore we must expect that the shape and size of an object is also 
not absolute, but must be stated in relation to a definite frame. 

As the simplest example, let us consider the length of a rod measured in 
different frames of reference. 

Let two frames be in relative motion in the direction of their common 3>axis. 
Their space and time coordinates are then connected by a Lorentz transforma¬ 
tion which we restate : 


vt 


t' = 


t — vxjc 2 


(15.01) 


V(1 - y 2 /c 2 )’ ' V( I-» 2 /c 2 ) 

y' = y> z ' = z 

Let the direction of the rod be the same as the direction of the relative velocity 
of the two frames—i.e. as the a>axis—and let the rod be at rest in the unprimed 
system. At all times the coordinates of the ends of the rod in that system will be 
x — a, y — 0, z = 0 (first end of rod) 

x — b, y = 0, 2 = 0 (second end of rod) 

and if 6 > a the length of the rod is 


(15.02) 


l = b ~ a (15.03) 

In the primed frame, which moves relative to the rod, the coordinates of its 
ends will be 


x’ = — vt ' + o', y' = 0, z' = 0 (first end) r 

x' — —vt' -f- b' , y' — 0, z' = 0 (second end) 


where 



(15.05) 


The length of the rod is the distance between simultaneous positions of its ends. 
In the primed system simultaneity means equal values of t' and distance is 
expressed in the usual way as a difference of primed coordinates. Therefore 
the length of the rod in the primed frame will be 


Hence 



(15.06) 

(15.07) 


Thus, in the frame in which the rod has a velocity v in the direction of its length 
it appears shortened ; its length V is less than the length l obtained for the 
rod at rest. 
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If we were to take into consideration the transverse dimensions of the rod 
(those in the directions of the y and z-axes and perpendicular to the velocity), 
we could verify that they do not change. Consequently the volume of the rod 
decreases proportionally with its longitudinal dimension. The same result is 
true for a body of any shape. If F is the volume of a body in the system in which 
it is at rest, its volume V' in a frame relative to which it moves with speed v 
will be 

< iro8 » 

Assume we have two identical rods parallel to each other with a relative velocity 
directed along their axes. Then, in the frame connected with the first rod the 
second will appear shortened, and vice versa. This state of affairs is the same 
as in the clock example and, as before, there is no paradox. The difference in 
the measured lengths comes from the different definitions of simultaneity. 
The positions of the ends of the rod that were simultaneous in one frame are 
not simultaneous, but only quasi-simultaneous in the other. Let x a be the 
position of the end A at the instant t a and x b the position of the end B at t b . 
The corresponding quantities in the other frame will be denoted by the cor¬ 
responding primed symbols. By the invariance of the space-like interval we have 

(*« - * 6 ) 2 - ~ ^ = K ~ O 2 ~ o% - t' b Y (15.09) 

If we here put t a = t b we shall have t' a -£■ t' b and hence 

\ X a <K ~ X b\ ( 15 - 10 ) 

On the left we have the length of the rod in the unprimed system ; the quantity 
on the right will be the length of the rod in the primed frame provided it is at 
rest in it, for then it is immaterial whether x' a (t') and x' b (t') refer to the same 
time t' or not and we can take x' a = x' a (t' a ) and x' b — x b (t' b ) with t' a =£ t' b . 

If we put t' a = t' b we have t„ -/= t b and hence 

k -**|>K - x ' b \ ( 15 - n ) 

but now the left-hand side is the length of the rod in the frame in which it rests 
whereas the right-hand side is the length in an arbitrary primed frame. 

In these considerations the symmetry of the equations with respect to 
primed and unprimed frames is evident from the beginning and, therefore, the 
considerations themselves may appear trivial. We have given them in order 
to emphasize once again the close relation between the concepts of simultaneity 
and of length. 

16. Relative Velocity 

In pre-relativistic mechanics the relative velocity of two bodies was defined 
as the difference of their velocities. Let the velocities of two bodies, both 
measured in the same frame of reference, be u and v respectively. Then the 
velocity of the second body relative to the first used to be defined as w = v — u. 
This definition is invariant with respect to Galileo transformations but not 
Lorentz transformations. Therefore it is not suitable in the Theory of Rela¬ 
tivity and must be replaced by another. The fact that w = v — u has no 
physical meaning becomes evident by examining the following example. Let 
the velocities u and v have opposite directions and have magnitudes near to 
the speed of light or equal to it. Then the “ velocity ” w will have a magnitude 
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near or equal to twice the speed of light, which is evidently absurd. 

We shall, therefore, give a new definition of relative velocity which is in accord 
with the requirements of Relativity and has a direct physical meaning. Let the 
velocities of two bodies in some frame of reference be u and v as before. We 
can introduce a primed frame of reference in which the velocity of one, say the 
first, body vanishes. Then we can interpret the velocity v' of the second body 
in this frame as the relative velocity of the two bodies. We shall see that the 
magnitude of v' will depend symmetrically on u and v, so that the so defined 
relative velocity of two bodies does not depend upon which body was chosen 
to be at rest in the new frame. 

To illustrate the physical significance of our definition we consider an example. 
Imagine we are observing two aeroplanes from the ground and let their velocities 
be u and v respectively. Assume that the first plane has radar equipment 
permitting a measurement of the speed of the other plane relative to itself. 
The velocity so measured will be the relative velocity of our definition. 

We must express this relative velocity in terms of the components of the 
velocities u and v of the two planes, as observed from the ground. For this 
purpose we write down the general formulae for a Lorentz transformation 
deduced in Section 10. These are 

r' = r - \t + (a 00 
t' = a 0 o (< - ^r) 

and their inverse is 

r = r' + W + (a 00 - 1) (V • r' + V¥), 

< = a 00 (f+^) 

where we have used the notation of (10.15), putting 

1 ‘ 

a °° = V(1 - h 2 /c 2 ) 

To have the first plane (the one of velocity u, relative to the ground) at rest in 
the primed system we must choose 

V x = u x ; V y = ity ; V z = u z (16.04) 


(16.02) 


(16.03) 


V 

Dyi (V - 


- VH), 


(16.01) 


The velocity of the second plane measured from the ground is 

dr 


v — 


dt 


(16.05) 


while its velocity measured from the other plane is 

V ' dt' 


(16.06) 


The relation between these two quantities is found by differentiating equation 
(16.01) and (16.02). We have 



47 


The Theory of Relativity 


v = 


v - u + (a 00 — l)(u/w 2 ){(u-v) - m 2 } 


and 


*00 


(1 — u • v/c 2 ) 


V = 


v' + u' + (a 00 - l)(u/w 2 ){(u -v') + w 2 } 


*00 


(1 + u-v'/c 2 ) 


(16.07) 


(16.08) 


where, by (16.03) and (16.04) 


a °° ~ v(i - (1609) 

Equation (16.08) is the solution of (16.07) for v. As the Lorentz transformation 
is linear with respect to space and time coordinates the primed components 
v' x , v' y and v' z are linear fractional functions of the unprimed component v x , 
v y and z z . Let us calculate the square of the vector v', i.e. the square of the 
relative velocity of the two planes. We have 

,, (u - v) 2 - (l/c 2 )[u x v] 2 


(1 -u-v/c 2 ) 2 


(16.10) 


As we have already noted, this expression is symmetrical with respect to u 
and v. We shall now verify that the inequalities w 2 < c 2 and v 2 < c 2 imply 
»'* < c 2 whatever the direction of the velocity vectors u and v. Indeed we have. 


v' 2 (1 - u 2 /c 2 )( 1 - v 2 lc 2 ) 

c 2 (1 — u-v/c 2 ) 2 


(16.11) 


Because u 2 < c 2 and v 2 < c 2 the right-hand side is always positive, so the 
left-hand side must also be positive and hence v' 2 < c 2 . In the limit when one 
or other of the velocities u and v is equal to light velocity the relative velocity 
v^will also be equal to it. This agrees with our basic assumption on which the 
whole of Relativity Theory is based. 

It should be noted that the square of the relative velocity is invariant under 
Lorentz transformations. This means that if in (16.10) we replace the quantities 
u and v by the velocities u" and v" of the two aeroplanes relative to some other 
frame, say a third plane, the value (16.10) will not be affected by the change. 
It is evident that this must be so by the meaning of v’ 2 as the square of a relative 
velocity. 

Equation (16.07) simplifies iff in the given frame the two velocities u and v 
are either parallel or perpendicular. In the first case we get 


v' = 


V — u 

1 — u-v/c 2 


(for [u x v] = 0} 


(16.12) 


and in the second 



{for (u-v) = 0} 


(16.13) 


Equation (16.12) with the opposite sign before u is usually called Einstein’s 
Addition Theorem for Velocities. 

In the general case one can assert the following : If “ velocity space ” is 
considered as a Lobachevsky space the relativistic addition theorem for 
velocities coincides with the vector addition theorem in Lobachevsky geometry. 
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This will be proved in the following section. 


17. The Lobachevsky-Einstein Velocity Space 

Let us consider the relative velocity of two bodies moving with infinitesimally 
differing velocities v and v + dv . Let ds be the magnitude of this relative velocity, 
divided by c. Inserting u = v -f dv into (1G.10) and dividing by c 2 we get 


or 


_ c-(dv ) 2 — fv x dv] 2 
' S (c a - v 2 )- 

^ 2 _ (c 2 — v 2 )(dv) 2 -f (v-dv ) 2 
(c 2 — i> 2 ) 2 


(17.01) 

(17.02) 


By the physical meaning of (17.01) as being proportional to the square of an 
infinitesimal relative velocity it must be invariant under Lorentz transforma¬ 
tions. This can be directly verified, by expressing v x , % and v z by (16.08) as 
fractional linear functions of v' x , v' y and v' z . This makes ds 2 the same function 
of (v' x , v' y , v' z ) and (dv' x , dv' y , dv' z ) as it was of (v x , v y , v z ) and (dv x , dv y , dv z ). 

We shall consider (17.01) and (17.02) as the squared element of length in a 
certain velocity space. This is the space in which the velocity hodograph of 
ordinary mechanics is constructed. This space possesses all the properties of 
Lobachevsky space, and the velocity components v x , v y , v z divided by c are 
the so-called Beltrami coordinates in it (see V. F. Kagan [9], eq. CVIII p. 453). 
The properties of Lobachevsky space can be derived by examining the 
expression (17.01) 

A curve in Lobachevsky space can be given by stating v x , v y and v z as 
functions of some parameter p. If the values p x and p 2 of p correspond to the 
end points of the curve, the arc length along the curve is given by the formula 


where 


s = | \/(2F) dp 

1 V 2 1 (v-v) 2 

2 c- — c 2 ^ 2 (e 2 v 2 )- 


(17.03) 

(17.04) 


differentiation with respect to p being denoted by a dot. We determine the 
equation of a Lobachevsky straight line, i.e. of the shortest curve joining the 
points p x and p 2 . This is done by equating to zero the variation of the integral 
(17.03), or, in other words, by writing down Lagrange’s equations for the 
Lagrangian 

L = vT-F) (17.05) 


These equations are 


d bL 
dp 8v x 


TL 

— — 0, etc. 
8v x 


( 17 . 06 ) 


d / 1 bF\ 1 if 
dp l \/{2F) buj y/(->F) ct'x 


(17.07) 


or 
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Up to now the parameter p was arbitrary. We now choose it so that 

dF 


dp 


= 0, F — const. 


With this choice equations (17.07) are equivalent to 

etc. 


d dF dF _ 

dp dv x dv x 


Since F does not contain p explicitly, we have 


dF dF dF 

Vx 77 + Vy — + Vi — — F = const, 
3% Ct'j, c?y* 
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(17.08) 


(17.09) 


(17.10) 


i.e. the quantity (17.10) is an integral of the Lagrange equations. But F is a 
homogeneous quadratic function of v x , v v and v z , so that (17.10) is equal to F 
and thus condition (17.08) is a consequence of the Lagrange equations (17.09). 

We now proceed to write equations (17.09) in more detail and to find their 
integrals. We have 

dF v x (v-v) 

- — - - f- Vr - 

di'x c- - v 2 (c 2 - v' 1 ) 2 


dF _ ( v 2 [ 2(v-v) 2 \ i . (v-v) 

dv~ x ~~ l ' x \(c 2 - r> 2 ) 2 ^ (c 2 - ?; 2 ) 3 .J ~ ! Cx (c 2 - v 2 ) 2 
We introduce a vector w with components 


(17.11) 

(17.12) 


‘<’x = - 


Vx 


<’z 


C 2 V 2 ’ 


c- - V- 


(17.13) 


Then equations (17.11) and (17.12) become 


dF (v-w) 

— =-- iv x I Vx -j-r, 

ov x c 2 - v- 


dF 

dv x 


(17.14) 

i) l %(vw) (17.15) 

Differentiating (17.14) with respect to the parameter p and expressing v in 
terms of w we get 

d dF (v-w) / (vw) 2 \ 

j- — = >i'x -b Vx - -- + v x l w 2 \ 2 ——-) ( w x {\ -w) (17.16) 

dp dc x <- - r- \ c- - v 2 / 

Inserting (17.15) and (17.16) into Lagrange’s equations (17.09) we see that 
they take on the form 


(v -w) 

IVX -b Vx —r-; — 0, etc. 

O* --11* 


(17.17) 


It is easily seen that as an algebraic consequence of these equations (17.17) 
we have (v-w) =0 whence it follows that (17.17) is equivalent to 

w x - 0, Wy — 0, icz 0 (17.18) 


or, vectorially, 


w - 0 


(17.19) 
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Thus Lagrange’s equations amount to the requirement that the vector w 
(defined by (17.13)) should be constant. But w is proportional to v ; if it is 
constant, so is the vector product 

[w x v] = const. (17.20) 


This gives two further linearly independent integrals of Lagrange’s equations. 

From the integrals found we see that the equation of a Lobachevsky straight 
line in velocity space is a set of linear relations between the velocity com¬ 
ponents v x , v y and v z . We know that Lorentz transformations lead to a frac¬ 
tional linear substitution for the velocity components. It is, therefore, evident 
that after a transformation the linear relations remain linear. 

We now determine the length along a Lobachevsky straight line between 
the points v = \ 1 and v = v 2 , and also establish a connection between this 
length and the relative velocity of two bodies of velocities v x and v 2 respectively. 
As the coordinates of points on the straight line are given by linear relations 
they can be represented parametrically in the form 


V = Vi + g(v 2 - Vj) (0 ^ (1 < 1) 
Inserting this expression for v into (17.04) we get 


2 F = 


c 2 (v 2 - Vi ) 2 - [Vi x V 2 ] 2 , 2 

( C 2 - V 2 ) 2 ^ 


(17.21) 

(17.22) 


Putting for brevity 

a = V{c 2 (v 2 - v,) 2 - x v 2 ] 2 } 


(17.23) 


we obtain from (17.03) an expression for the length along the line from v, 
to v 2 



[ a d[i 

J (c 2 — V 2 ) 

0 

(17.24) 

This integral is 

evaluated most simply by means of the substitution 



(c 2 - v\)l 

[l C 2 - Vj-Vj + K-V, -vffi 

(17.25) 

giving 

r ab d% 1 b + a 

S J 6 2 — a 2 £ 2 2 ° g b^~a 

0 

(17.26) 

where 

b = C* — v 1 -v 2 

(17.27) 


The coefficients of the substitution were so chosen as to have 0 and 1 for the 
limits of £ and so that the denominator in (17.26) does not involve the first 
power of Hence we obtain 


a 

b 


= tanh s 


(17.28) 


and, after squaring and multiplying by c 2 , this can be written as 

(v,-v,)’ -( KOiv. xvJ- = 0 , , 

[1 - K-Vj/C 2 )] 2 

Comparing this expression with (16.10) we see that the left-hand side is the 
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square of the relative velocity. Thus the relative velocity is related to the 
length of the corresponding segment of the Lobachevsky straight line by 

|v'| = c tanhs (17.30) 

Let us assume that the velocities v, and v 2 have the same direction. Expressing 
their magnitudes in terms of segments of a Lobachevsky straight line we have 

« 1 = ctanhs 1 ; t> 2 = ctanhs 2 (17.31) 

The length of Lobachevsky line corresponding to the relative velocity will be 
equal to the difference of the lengths s 2 and p v Therefore, if s 2 > s v the relative 
velocity will be 

, , , , tanh s 2 — tanh s , 


v' = c tanh (52 — s x ) = c - --—---— (17.32) 

1 — tanh s 2 tanh ,9 X 

, v 2 — v-, 

or v’ = —i--i— (17.33) 

1 - (« 2 «!/C 2 ) 

This is just the Einstein addition, or rather in our case subtraction, formula. 

Let us now consider the angle between the relative velocities of two bodies. 
If the velocities are taken relative to a point assumed at rest the cosine of the 
angle is given by the usual formula 


COS (Vj, Yo) = 


(17.34) 


but if the velocities are given relative to a point that is itself in motion with a 
velocity u, the angle between the relative velocities is given by a more compli¬ 
cated formula, which, however, can easily be obtained as follows. We make a 
Lorentz transformation to bring to rest the point that was moving with velocity 
u. Then we can apply the usual formula (17.37), getting 


cos a = cos (v{, v 2 ) = 


(17.35) 


with v' and V 2 denoting the velocities of the two bodies after the Lorentz 
transformation. These velocities are obtained from (16.07), replacing v by 
Vj and by v 2 . Expressing v' and v 2 by v x and v 2 , we obtain after some 
manipulation 

(V, -U) • (v 2 -U) - (1 lc i ) \Y 1 X u]| v 2 X u] 

POS Of — _———■— —_ 

V{( v i ~ u ) 2 - (i/c 2 )[ V 1 X u J 2 }V {( v 2 - u ) 2 - (l/c 2 )[v* X u] 2 } 

(17.36) 

This, however, is just the expression for the cosine of the angle of a triangle in 
Lobachevsky space. (It is the angle at u in a triangle whose vertices are at the 
points u, v x and v 2 .) 

Indeed our initial expression (17.01) for the square of the element of length 
in Lobachevsky space has the form 

*■ - qgL-Zf r . x ‘frl 1 ,17.37) 

(c* -- v-y l 

This expression corresponds to a “ displacement ” dv from the “ point ” v. 
For the displacement Sv from the same point we have 

s <.2 _ ° 2 ( 8v ) 2 ~ [v X Sv ] 2 
(c 2 - v 2 ) 2 


(17.38) 
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The cosine of the angle between the displacements can be stated in an invariant 

manner by means of the equation 

, ^ c 2 dv 8v -- [v x dv][v x Svl 

ds 8.v cos a =- -L -—- i (17.39 

(c 2 — v 2 ) 2 


(This formula can be considered as the definition of an angle in Lobachevsky 
geometry.) If now in (17.36) we write v instead of u and if we insert into 
(17.39) the displacements 

dv = e(v 1 — v) ; 8v = t)(v 2 — v) (17.40) 

along two sides of a triangle starting from the vertex v, the expression for cos a 
obtained from (17.39) is the same as in (17.36). 

Thus the angle between relative velocities can be thought of as an angle 
in Lobachevsky geometry. Given three bodies moving with velocities v 1( v 2 
and v 3 the corresponding triangle has its vertices at the points v x , v 2 and v 3 ; 
the relative velocities will correspond to the sides of the triangle and the angles 
between the velocities to the angles of the triangle. A similar construction is 
possible in non-relativistic kinematics, but there the geometry of velocity space 
is Euclidean whereas in the Theory of Relativity it is Lobachevskian. 

The validity of Lobachevsky geometry in velocity space has its experimental 
verification. We refer to Fizeau’s experiment which measures the speed of 
light in a moving medium and to the phenomenon of astronomical aberration 
discovered by Bradley. 

Fizeau’s experiment aims at comparing the velocity of light propagation in a 
moving and in a stationary medium. The propagation of light in a medium is 
a rather complicated process involving the charges within the medium ; a 
velocity c/n can be attributed to the propagation where n is the index of re¬ 
fraction of the medium. If the medium itself moves in the direction of the 
propagation with a speed v then w = c/n will be the speed of light propagation 
relative to the medium. The speed relative to a frame at rest can then be 
obtained by Einstein’s equation 


w + v 
1 + (wv/c 2 ) 


(17.41) 


Inserting here w — c/n and retaining terms of first and zero order in c we get 


c 

A 

i \ / 

1 \ 

n 

- d 1 


n 2 / 


(17.42) 


The coefficient of v bears the name Fresnel’s convection coefficient. 

The fact that this coefficient is not unity shows that the addition of velocities 
must be performed according to Einstein’s addition formula, corresponding 
to Lobachevsky geometry, and not according to the pre-relativistic rule based 
on Euclidean geometry. 

The phenomenon of astronomical aberration consists basically in the fact 
that in two frames in relative motion the directions to one and the same star 
do not coincide but differ by the aberration. To find this quantity one should 
construct a Lobachevsky triangle with vertices at the points v 1; v 2 and v 3 = ac 
where v x and v 2 are the velocities of bodies fixed in the two frames and a is a 
unit vector in the direction of the light wave emitted by the star. In the triangle 
Vj, v 2 ,v 3 the angle at the vertex v 3 will be zero (see equation (17.44) below), 
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and the sum of the two angles at \ 1 and v 2 will be less than two right angles by 
the magnitude of the aberration. (If the geometry were Euclidean the sum of 
these angles would be equal to two right angles.) 

The required trigonometrical calculation can easily be performed using the 
formula 

= _ (▼» - v i )( v 3 - Vi) - (1 /c 2 )[v 2 X vjv, X vj _ 

C0S \/{( v 2 - v i) 2 - (1 /c 2 )[v 2 X vj*} • i/{(v 3 - V,) 2 - (l/c 2 )[v s x V!] 2 } 

(17.43) 

and two other formulae obtained from (17.43) by cyclic permutation of the 
suffixes 1, 2 and 3. 

First of all, since u 2 = c 2 , we have 

cos a 3 = 1 ; a 3 = 0 (17.44) 

We then choose a frame such that 


V]t + v 2 == 0 

and put j v i| = | v 2 | = v > a-Vi = — av 2 = v cos (3 

The relative speed of the two systems will be 

2» 


1 + t) 2 /c 2 


From (17.43) we get 


v— ccosB . (\/(c 2 — u 2 )} sm B 

cos a, =-sin a t = ----h 

c — v cos p c — v cos (5 

and, by analogy, 

v + c cos B . {\/( c2 — ® 2 )} sin P 

cos a 2 =- - ; sm a 2 = —---- 

C + V COS P C + V COS p 

Denoting the magnitude of aberration by 8 we can put 

8 = 7t — a! — a 2 — a 3 — 7r — a l — a 2 
The previous equations give 

2d 2 sin 2 B 

_ 1 i i \ _ r 


2 sin 2 - 2 S = 1 + cos («! +<^ 2 ) = 


C 2 — V 2 COS 2 P 


whence 


tan j8 = 


v sin p 
V(c 2 - u 2 ) 


(17.45) 

(17.46) 

(17.47) 

(17.48) 

(17.49) 

(17.50) 

(17.51) 

(17.52) 


In astronomical observations one compares the apparent position of a star 
at times of different direction of the Earth’s orbital motion (annual aberration). 
Since only the relative velocities of the bodies that receive the starlight enter 
our considerations, the overall motion of the Solar System relative to the stars 
considered will evidently have no effect as long as its velocity is constant during 
the lengths of time involved. It is, therefore, not possible to determine the 
velocity of a star relative to the Earth or the Sun by observing its aberration. 
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THE THEORY OF RELATIVITY IN TENSOR FORM 

18. Some Remarks on the Covariance of Equations 

In Section 6, when we discussed the basic postulates of the Theory of 
Relativity, we established certain general requirements to which any equations 
must conform, which determine the course of physical processes and do not 
involve initial conditions. These requirements fix the rules according to which 
such equations must transform on passing from one inertial frame to another. 

According to the Principle of Relativity the rules for transforming the 
independent variables and the unknown functions entering the equations 
must be such that the equations stated in one inertial frame are equivalent 
to equations of the same form in any other inertial frame. 

We have already used this requirement in discussing the Lorentz trans¬ 
formation. Indeed, we obtained this transformation from the condition that 
the equation for wave front propagation 

asr-[(£Msr+©■]=» <“*> 

should preserve its form and also that uniform rectilinear motion should 
remain uniform and rectilinear in all inertial frames. Thus the rules for trans¬ 
forming the independent variables, i.e. the space and time coordinates, have 
already been fixed, by the above requirements, but we have yet to consider 
the transformation rules for the unknown functions occurring in the equations. 

The simplest transformation rule is simple invariance. For instance, in the 
case of equation (18.01) « is the unknown function. It is determined by the 
wave front equation 

u>(x, y, z, t) — 0 (18.02) 

The transformed function to', giving the wave front equation in the new 
variables 

w'(*', y', z\ t') = 0 (18.03) 

is simply equal to the old function; its form is determined by the condition 

w'(x', y', z', t') = i»{x, y, z, t) (18.04) 

However, in the majority of problems the form of an equation is only con¬ 
served if the Lorentz transformation on the independent variables is accom¬ 
panied by certain transformations of the unknown functions. If a trans¬ 
formation exists such that the new functions, expressed in the terms of the 
new variables, satisfy equations of the same form as do the old functions in 
the old variables, the equations are termed covariant. 
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The requirement of covariance of equations under Lorentz transformations 
is a necessary consequence of the Principle of Relativity. On the other hand, 
it is clear that not all equations are covariant. We shall have to examine the 
equations used in the physical description of various processes, e.g. the equations 
of electrodynamics and of mechanics, in order to decide whether they are 
covariant or not. If some of them are not, we shall have to alter them to a 
covariant form. 

A preliminary to investigating covariance and to formulating covariant 
equations is the study of the quantities with the simplest transformation 
laws. Only linear laws need be considered. The complete classification of 
quantities according to their transformation laws is a question for Group 
Theory. We cannot here deal with it in its full generality and shall confine 
ourselves to what is essential for the formulation of the basic equations of 
mechanics and electrodynamics. 

19. Definition of a Tensor in Three Dimensions and some Remarks on 
Covariant Quantities 

The equations describing a rotation of the axes of a spatial coordinate 

frame are 

3 

•I'l 7 f ^ik X k 

*=* (», k = 1 , 2 , 3 ) ( 19 . 01 ) 

x i= 2 v -ik x i 

where the ct-a are the direction cosines relating the old and the new axes. 
Formally we can define a three-dimensional vector as a set of quantities (A lt 
A 2 , T 3 ), called components, which transform according to 

Ai = i* ik A k ( 19 . 02 ) 

when the axes are rotated. A special case of a vector is the coordinate vector 
for which A 1 = x„ A 2 = x 2 and A 3 — x 3 . 

Let us assume that two vectors are given and that their components A ; 
and Bi are connected by the linear relation 

Bi=2 Ta An ( 19 . 03 ) 

k—l 

After a rotation cf the axes the new vector components will be connected 
by the analogous relation 

B; = 2 r ik A k ( 19 . 04 ) 

fc=l 

with 

T' ik = 2 *imiTn ( 19 . 05 ) 

1 , 1 =1 

For the old and the new components of the vector Bi must be connected by 
the same transformation as those of At. If first we express the B' t in terms of 
the Bi ; and then in ( 19 . 03 ) express the Ai in terms of the A' v we obtain ( 19 . 04 ) 
and ( 19 . 05 ). 
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Thus the set of quantities Tik transforms according to (19.05) when the 
axes are rotated. Such a set is called a tensor, or more precisely a tensor of 
the second rank. (In general the rank of a tensor is equal to the number of 
suffixes attached to it.) 

Examples of tensors occur even in non-relativistic mechanics. Thus the 
components of the angular momentum of a rigid body ( Bi ) are connected 
with the components of its angular velocity (A{) by relations of the form 
(19.03) where Ttk is the inertia tensor. Another example occurs in the theory 
of elasticity, where similar relations give the connection between the com¬ 
ponents (dF x , dFy, dF z ) of the force on some surface element and the pro¬ 
jections ( dS x , dS y , dS z ) of the surface element on to the coordinate planes. 
In this case the set of coefficients is the stress tensor. In both these examples 
the tensor Ttk is symmetric in its suffixes, but tensors also exist which do not 
have this property. 

Equation (19.05) shows that the components of a tensor transform like the 
products x t Eic of the coordinates (x v x 3 , x 3 ) and (Ej, i; 2 , E 3 ) of two points, which 
may be coincident (xt = E<)- 

Let us consider an antisymmetric tensor, i.e. one whose components satisfy 
the relation 

Ttk + T ki = 0 (19.06) 

An example of such a tensor is the set of antisymmetric combinations formed 
from the products of the coordinates of two points 

T ik = xtlk - Itxk (19.07) 

As the suffixes we are here concerned with can only take on the three values 
(1,2, 3), we can replace any pair of suffixes by the single missing one and put 

t 23 = - t 32 = 

T 3 1 = - T 13 = T 2 (19.08) 

T 12 = — T 3 

We introduce the antisymmetric system of quantities defined as 
stkt — H E >f (E l) is an even permutation of (1, 2, 3); 
s tki — —1, if (i, k, l) is an odd permutation of (1, 2, 3); 
zm — 0 if any of the suffixes are equal. 

Then (19.08) may be written as 

= 2 2 ZmTik (19.09) 

«,*=i 

The properties of the ttu ensure that in fact the sum in (19.09) has only two terms 
and if (>, k, 1) is an even permutation of (1, 2. 3) the previous equation may be 
written as 

T x = T t k (19.09*) 

Using the antisymmetry condition we can replace (19.05) by 

1 

Tik — — / (XipXkq X(qXkp) Tpq (19.10) 

p,g=l 

In this sum three of the nine terms vanish, namely those with p = q. The 
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other six are equal in pairs, so that only three essentially different terms occur. 

Let (i, k, l) and (p, q, r) be even permutations of ( 1 , 2, 3). Using the notation 
(19.09) and putting 

O'ip&kq CC iqCLkp = |^ir (19.11) 

we can rewrite equation (19.10) as 

T\= I MV (19-12) 

r=l 

Let us now consider the determinant 

K 11 a 12 a 13 

A ~ a 21 cc 2 2 ^23 (19.13) 

a 3X a 32 a 33 

By the orthogonality of the transformation A 2 = 1, and, in particular, if the 
transformation is purely a rotation of the axes A = +1, while if the rotation 
is accompanied by a reflexion, i.e. a reversal of one or of three axes, A = — 1. 
In both cases 

pir = A • air (19.14) 

Thus for a simple rotation of the axes we have 

T\=2 « lr T r (19.15) 

r=l 

and for an “ improper orthogonal transformation ”, i.e. a rotation with 
reflexion, 

(19-16) 

r=l 

. Equations (19.15) and (19.16) show that in a pure rotation the set of quantities 
(19.08) transforms as a vector and in a rotation with reflexion their transfor¬ 
mation is the same as for a vector except for a change of sign of all components. 
A set of quantities having this transformation law is conventionally called an 
axial vector as distinct from a polar vector which is one that obeys (19.02) in 
all transformations. The vector product of two polar vectors is easily seen to 
be an axial vector. To give physical examples (in terms of three dimensional 
vector algebra), the vector of electric field intensity is polar and the vector of 
magnetic field intensity is axial. Equations (19.08) show that an axial vector 
is in essence a tensor of the second rank, therefore if the terms “ vector ” and 
“ tensor ” are used in the sense of definitions (19.02) and (19.05) one can do 
without the term “ axial vector 

Tensors of higher rank in three dimensional Euclidean space can be defined 
similarly. For instance, a tensor of the third rank is a set of quantities T ijk 
which transform on rotation according to the law 

T'ijk = 2 (19.17) 

l,m,n~ l 

Further, a quantity that does not change when the axes are rotated can also 
be considered to be a tensor, namely a tensor of zero rank. Such a quantity 
is called a scalar or an invariant. 

Given a tensor of the second rank one can find a linear combination of its 
components which does not change when the axes are rotated, and thus is a 
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scalar. The linear combination having this property is the sum of the diagonal 
elements of the tensor, i.e. the quantity 

T = 2 T i( (19.18) 

i 

Using the property 2 (19.19) 

i 

of the coefficients of an orthogonal transformation one verifies easily that 
(19.05) leads to 

T = T (19.20) 

so that T is a scalar. Similarly, from the components of a third rank tensor, 
one can obtain three vectors, namely 

^ = 2 T lmm ; B t = 2T mlm ; C t =2T mml (19.21) 

m m m 

One can ask whether it is possible to replace tensors by other quantities of 
various ranks in such a way that it is no longer possible to form from compon¬ 
ents of one rank any linear combinations that transform like quantities of 
lower rank. 

This can indeed be done and the resulting quantities have the transformation 
properties of solid harmonics. We recall that a solid harmonic Pi m (x, y, z) 
is a harmonic polynomial which, expressed in terms of spherical polar co¬ 
ordinates 

x = r sin 9- cos 9 ; y = r sin & sin 9 ; z — cos & 
becomes a product of r l with a surface harmonic 

rlY im(&> 9 ) = Pim( x >y> z ) 

The order l is equal to the rank of the corresponding tensor, but the number 
of linearly independent “ components ” will be less than for a tensor. If the 
coordinate frame is rotated the new polynomial Pi m {%', y', z') can be expressed 
as a linear combination of the old polynomials Pi m (x, y, z), all with the same 
l, and with m taking on the values m — —l, —1+1, . . . , l. The coefficients 
of this linear transformation determine the desired transformation law for 
quantities of rank l. 

One final remark : We have assumed in our discussions that the tensors 
or related quantities are, by their physical meaning, completely determined, 
including their signs. Correspondingly, the coefficients in our transformation 
formulae were single-valued functions of the cosines a«. However, one can also 
imagine quantities that are only defined except for a sign, so that only quadratic 
expressions in them are fully determined. In this case the coefficients of their 
transformations need not be uniquely fixed by the a<* but may instead be chosen 
to be determined only apart from their sign (the same in all coefficients). This 
leads to a new type of physical quantity, the so-called spinor, and to a cor¬ 
responding generalization of spherical harmonics and their transformation 
law. Such quantities are used in quantum mechanics. 

Tensors are thus not the only geometrical quantities with a definite transfor¬ 
mation law, but in ordinary, non-quantal applications of the Theory of 
Relativity the consideration of tensors is sufficient. 
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20. Definition of a Four-Dimensional Vector 
In the preceding section we summarized the definition of tensors in Euclidean 
space. We must now generalize this definition to apply to the four-dimensional 
space-time manifold. The part played before by rotations of the axes is now 
taken over by Lorentz transformations. An essentially new feature is the 
difference of sign between the space terms and the time term in the expression 

ds 2 = c 2 dt 2 — dx 2 — dy 2 — dz 2 (20.01) 

which gives the square of an infinitesimal four-dimensional interval. We have 
seen that the invariance of (20.01) characterizes Lorentz transformations in 
the same way as the invariance of 


cU 2 = dx 2 + dy 2 + dz 2 (20.02) 

characterizes rotations of spatial axes. 

The difference in sign is extremely important for the whole theory because 
it reflects the existence of a deep distinction between space and time. 

It would be possible to impose the same sign on all terms of ds 2 by introducing 
imaginary quantities—imaginary space coordinates or imaginary time. This 
course was adopted by Minkowski and by Umov ; but we believe that it is not 
appropriate to introduce a symmetry between space and time into our equations 
in this manner because it obscures the actually existing distinction between 
them and does not have any mathematical advantages. Therefore we shall 
continue to use real space variables and a real time. 

If we put 

ct — x 0 ; x = x x ; y — x 2 ; z = x 3 (20.03) 


the expression for ds 2 becomes 


ds 2 — dx 2 — dx 2 — dx 2 — dx 2 = 2 e * dx\ 


(20.04) 


*=o 


where the quantities e* are equal to ±1 (see (8.25)). The direct and inverse 
Lorentz transformations can be written as 


2 e k a ik x k 


"i — ^k ik k 
k =0 


(20.05) 


and x ( = 2 W* (20.06) 

k =o 

Therefore the differentials of old and new coordinates will be connected by 

dx- = 2 vA (20.07) 

k =0 

and the partial derivatives of any function cp with respect to the old and the 
new coordinates by 

(200S) 

< k=o * 

If the quadratic form (20.04) were definite, all the numbers ot would be equal 
and so would the coefficients in (20.07) and (20.08). In fact, we have 


e 0 — 1 ; e 1 = e., — e 3 = --1 (20.09) 


and some of the corresponding coefficients in the two equations differ in sign. 
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In contrast to the case of purely spatial rotations, the transformation laws are 
different for coordinate differentials and for partial derivatives with respect 
to the coordinates. This must be remembered when defining a vector. We 
must distinguish between vectors whose components transform like the partial 
derivatives of the coordinates and those for which they transform like co¬ 
ordinate differentials. The former will be called covariant, the latter contra- 
variant. We shall write the components of a covariant vector with suffixes, 
or lower indices, and the components of a eontravariant one with superjixes, or 
upper indices. Thus the transformation law for a covariant vector is written as 


3 


A-t — 2 e i a ik^k 
k =0 

and for a eontravariant one as 

(20.10) 

A' 1 ■ 2 ek-atkA k 
*■=o 

(20.11) 

These relations can also be expressed as 


k =0 * 

(20.12) 

and A'* = — A k 

Z-, 8x k 

Jr-0 * 

(20.13) 


One should note that the coordinates and their differentials are exceptional 
in violating the rule of index positioning. The notation dx 0 , dx t , dx 2 , dx 3 is 
customary although by the general rule one should write (da;) 0 , (dx) 1 , (dx) 2 
and (dx) 3 . 

If a covariant vector At is given we can always introduce a corresponding 
eontravariant vector by putting 


A* = e t At (20.14) 

The two vectors are not essentially different and we shall speak not of two 
vectors but of the covariant and eontravariant components of tl)e same vector 
and denote them by the same letter. 

The scalar product of two vectors At and Bt is defined as the sum 


It can also be written as 


2 etAtBt = f etAW 

i=o i=o 


(20.15) 


2 A t B ( = 2 A l B t 


i =o <=>o 


(20.16) 


This quantity is invariant under Lorentz transformations. 

The scalar product of a vector with itself 

t AtA* = (A 0 ) 2 - (A f) 2 - (A,) 2 - (A,) 2 (20.17) 

t = 0 

can be positive or negative. A vector for which (20.17) is positive is called 
“ time-like ” and a vector for which this expression is negative is called “ space¬ 
like ”. 


As an example of a time-like vector we can mention the four-dimensional 
velocity, whose components are defined by the equations 
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Ur = 


V(1 - v*lc*) ’ 


!(< = 


Vi 


V(1 - v 2 /c 2 ) 


(i = 1, 2, 3) (20.18) 


The fact that this set of quantities is a vector can be seen as follows : We know 
that the quantity 

c ih = ds = {V(c 2 - v 2 )} dt (20.19) 

is invariant; on the other hand the quantities (20.18) can be written in the 
form 


dt dx 0 
dx dx ’ 


, dxi 
w‘ = — 

AT 


(i = 1,2, 3) 


( 20 . 20 ) 


Hence it is clear that u°, u 1 , u 2 and u 3 transform like the differentials of the 
coordinates x 0 , x lt x z , x 3 . They are thus components of a contravariant vector. 
That the velocity vector is time-like follows from the identity 

( u°) 2 - (m 1 ) 2 - (w 2 ) 2 - (w 3 ) 2 = c 2 (20.21) 

The four-dimensional vector of acceleration whose components are defined by 


w° = 


du° 


w l = 


du l 

dt 


(i = 1, 2, 3) 


( 20 . 22 ) 


is an example of a space-like vector. In more detail, we can write 


1 d / c \ 
V(1 ~v 2 lc 2 )dtW( 1 -v 2 lc 2 )) 

1 d I 1 dxi\ 
■y/(l — v 2 jc 2 ) dt\^{\ —v 2 jc 2 ) dt) 


(i = 1, 2, 3) 


(20.23) 


The space-like nature of this vector is proved by differentiating the identity 
(20.21) with respect to time, or to t, which gives 

u°w° — u l w l — u 2 w 2 — u 3 w 3 = 0 (20.24) 

or cw° — VjW 1 — v 2 w 2 — v 3 iv 3 = 0 (20.25) 

Hence we get the inequality 
v 2 

( W 0)2 <- _ {( w l)2 + ( W 2)2 + ( W 3)2J < ( W l)2 + ( W 2)2 + ( w 3)2 (20.26) 

c 

which proves our statement that w is space-like. 


21. Four-Dimensional Tensors 

One can define tensors of the second and higher ranks in the four-dimensional 
space time manifold just as in three-dimensional space. 

A set of quantities transforming under Lorentz transformations according 
to the law 


T' 


= -y 

dx, dx’ 11 
l,i=o * * 


( 21 . 01 ) 


or 


3 

^ik = 2 e i e k a ij a klTjl 
1 , 1=0 


( 21 . 02 ) 


is called a covariant tensor of the second rank. Similarly a set of quantities 
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transforming according to 

dx, ox' . 

J"ik — N t _ kr []i 


= V OX k r A 

^—i ox, dx, 
1, 1=0 1 1 


or 


(21.03) 

(21.04) 


T'tk — ^ CjeiaijOkiTt 1 
o 

is called a contravariant tensor. Lastly, one can also define a mixed tensor of 
the second rank as one that is covariant in one index and contravariant in the 
other. Its components transform according to 

3 

. ri:r. rr.r„ 

(21.05) 

"1 v ** 

or T'* = 2 e fk a l{ x kl T\ (21.06) 

l,i=o 


v' dx', dx, 
k 2 dx, dx' 1 


If all four numbers e t were equal, the coefficients in (21.02), (21.04) and ' 
(21.06) would be the same, in fact there are sign differences between corres¬ 
ponding coefficients. 

The transformation formulae (21.02), (21.04) and (21.06) are not essentially 
different, and one can easily pass from quantities transforming according to one 
of them to others transforming according to either of the other two. To do 
this one simply puts 

Z™ = etekTu (21.07) 

while T\ may be defined either as 

Tt k = e i T ik (21-08) 

or as rtj. = e ( T ki (21.09) 

If the tensor is symmetric the quantities (21.08) and (21.09) are the same. 
Just as for a vector, we shall not speak of different tensors but of covariant, 
contravariant and mixed components of one and the same tensor. 

Similarly, tensors of the third and higher ranks may be introduced. For 
example a tensor of third rank with three covariant indices transforms accord¬ 
ing to the law 



T — 

ikm 

Y 8x l dx l dx n m 

2, dx', dx' dx' lln 
j,l t n =o < k m 



(21.10) 

or 

Blkm — 

3 

2 ^i^k^m a ij a klO'mnTjin 

(21.11) 


1, l,n=o 


As in the three-dimensional case, one can form a scalar from components of a 
second rank tensor, namely 

T = ZeiTu (21.12) 

<=o 

and three vectors can be constructed from the components of a tensor of the 
third rank : 

At = 2 e kTikk ; Bi = 'ZekTkik ; Ot — f^ekTkki (21.13) 

k k k 

We shall not dwell on the question touched on at the end of Section 19 con- 
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cerning quantities other than tensors whose transformation laws are the simplest 
in a certain sense. This is because in all non-quantal applications of the Theory 
of Relativity the main role is played by tensors. 

Given two tensors other tensors can be constructed of a rank equal to the 
sum, or to the difference, of the ranks of the given tensors and also tensors 
of ranks intermediate between these, but of the same parity as the sum and 
difference. We shall show this by examples. In Section 20 we saw that two 
vectors At and Bt can form the scalar 

C = 2 etAiBt (21.14) 

<=o 

Vectors are to be thought of as tensors of the first rank and scalars as tensors 
of zero rank. We can also use the same vectors to construct two tensors of 
second rank, namely 

C (k = A t B k and C (k = BtA k (21.15) 

Given a tensor of second rank and a vector A{, the quantity 

B< = 2 o k A k Tt k (21.16) 

fc 

will be a vector and the quantity 

Cm = AtTu (21.17) 

a third rank tensor; so will the quantities obtained by interchanging sufifixes 
on the right of (21.17). 

As a last example we take second rank tensors T{ k and TJi k and form the scalar 


3 


the second rank tensor 

C= 2 e i e icTi k U{ k 

i,k= o 

(21.18) 

and the fourth rank tensor 

Cik — 2 e m-L imUkm 
m 

(21.19) 

In this case, evidently 

and 

Ciklm TiJcUim 

Vlk — 2 GfnPimkm 

m 

(21.20) 

(21.21) 


q 

II 

o 

(21.22) 




The tensor character of all these quantities is readily verified from the properties 
of the coefficients of a Lorentz transformation as given by (10.04) and (10.05). 

It should be observed that in all s ums containing products of covariant 
tensor components these products are multiplied by the sign factors, e<, e k , . . . 
where i, k, etc., are s uffix es over which a summation is performed. Instead one 
can write these sums so as to have each index occurring once as the covariant 
suffix of one tensor, and the other time as the contravariant superfix of a 
second tensor. In• this form the sign factors do not appear. Thus, for example, 
the scalar product (21.14) can be written in the form (20.16) and the sum 
(21.18), using (21.07), as 

C = 2 T^Uac (21-23) 

i,*=o 

Given a tensor, other tensors can be obtained not only by multiplication by 
a second tensor but also by differentiation. The operation of differentiating 
with respect to a coordinate xt then appears in place of multiplication by a 
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covariant component of some vector. For instance, the quantity 

8A { 

dXi 

is a scalar and the set of quantities 

*-Z 


,, _ vr' osti 8A* 

2, e< bxt 2-, dxi 


<=o 


<=o 


Clc- 


tc=o 


SToc 

dxic 


is a vector (the divergence of the tensor Ta). If in particular 

A ‘ = ? 

0X{ 

where cp is some scalar, the result of inserting At into (21.24), 


□ 9=2 


et 


8 2 cp 8 2 cp 8 z cp 8 z cp 8 2 cp 


8x\ 


8x\ 


8xl 


8x 2 


8x\ 

<=o 4 

must also be a scalar. The symbol □ is the d’Alembert operator. 


(21.24) 


(21.25) 


(21.26) 


(21.27) 


22. Pseudo-Tensors 

In addition to tensors it is convenient to consider other quantities whose 
transformation law depends on the sign of the determinant 


D = 


D{x 0 , x lt x 2 , x 3 ) 


= Det (e<a«) 


( 22 . 01 ) 


D(x' 0 , x[, x' 2 , a^) 

of the transformation. The properties of the coefficients in a Lorentz trans¬ 
formation ensure that the square of D is always unity. D itself will be +1 
for proper Lorentz transformations, i.e. those which preserve the sense of time 
and also the right- or left-handedness of the spatial axes, but —1 for improper 
transformations. Although we agreed to consider proper transformations 
only, it is useful for the classification of geometric quantities to know also their 
behaviour for improper transformations. 

We consider a set of quantities ztkim, antisymmetric with respect to all 
suffixes and with e 0123 = 1. It follows that if two or more suffixes are equal 
sticim must vanish, that zmm — +1 if (i him) is an even permutation of (0 1 2 3) 
and that zocim = —1 if it is an odd permutation. 

The identity 


8x P 8x q 8x r 8x s 


2 CXp uXq OXf 

P9rs 8x' 8x' k dXi 8x' m 


pqrs =0 




Sikim 


( 22 . 02 ) 


is readily verified, for the left-hand side is the expansion of the determinant 

8x 0 8x 0 8x 0 8x 0 



8x 3 8x 3 8x 3 8x 3 
8x\ 8x' k dx[ 8x m 

which is obtained from D by permuting columns, provided all four suffixes 
(i k l m) are different, and which vanishes otherwise. 

Equation (22.02) shows that if D = 1 the e transform like components of a 
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tensor, but that if D = — 1 their transformation law differs in sign from the 
tensor rule. A set of quantities having such transformation laws is called a 
pseudo-tensor. 

We can consider the zm m to be covariant components of an antisymmetric 
pseudo-tensor of fourth rank. Its contravariant components are, by the 
general rule, 

e ikim — eieiceie m sikim (22.03) 

but since the e are only different from zero if all suffixes are different, in which 
case eie*e;e OT = e 3 e x e 2 e 3 = — 1 , we have more simply 

t mm — — Siklm (22.04) 

If 9 is a scalar the set of quantities 


'ftklm — (22.05) 

is an antisymmetrical pseudo-tensor of fourth rank. Such a pseudo-tensor 
has only one component, just like a scalar. Hence it is customarily known as a 
pseudo-scalar. 

To every antisymmetric tensor A a of the second rank one can adjoin an 

* 

antisymmetric pseudo-tensor A ilc of the same rank by the equation 


* 1 Ak 

A™ = - 2 z tklm Aim (22.06) 

l, m=0 

It is called the dual of the given tensor. Only two terms in the sum (22.06) 
differ from zero and these terms are equal, therefore we have 


* 

* 

* 


A 10 = A 23 ; 

A 20 — A • 

" — ^31 » 

A 30 = 

A 12 

* 

* 

* 


A 23 _ A 

A- — ^ 10 > 

A 31 ^A 20 ; 

A 12 = 

■^30 


(22.07) 


Similarly there corresponds to any antisymmetric tensor of the third rank a 
pseudo-vector—i.e. a pseudo-tensor of the first rank, given by the equation 

♦ I ^ 

i< = - 6 J (22.08) 

k, l, m-o 

There are six non-vanishing terms in the sum, but all six are equal. Therefore 
we obtain the explicit form 

A 0 — — A 123 ; A 1 = A 230 ; A 2 = A 3 10 ; A 3 = A 120 (22.09) 

for the components of the pseudo-vector. 

We shall refer for short to the “ product of a tensor with a tensor ”, meaning 
a tensor obtained by multiplying the components of two given tensors accord¬ 
ing to the rules of Section 21. Then we can say that the product of two pseudo¬ 
tensors, like the product of two tensors, is a tensor, but the product of a 

tensor with a pseudo-tensor is a pseudo-tensor. We have, in fact, already used 

* * 

this rule in forming the pseudo-tensor A ik and the pseudo-vector A 1 . 


23. Infinitesimal Lorentz Transformations 

The general Lorentz transformation, including a shift of origin, has the form 
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s 

x'i = a t -f 2 e/caacXk (23.01) 

*=o 

We examine the case when this transformation is only infinitesimally different 
from the identity. In this case the constants a< are infinitesimal and the coeffi¬ 
cients oji may be written as 

aa = eicSik + (23.02) 

where the are infinitesimal. To stress the fact that the coordinates form a 
contravariant vector, we shall, in this section, write them with superfixes. 

x° — ct ; x 1 = x, x 2 = y, x 3 — z (23.03) 

We shall likewise write the infinitesimal constants describing the shift of origin 
as a* instead of a t . In this notation the result of inserting (23.02) into (23.01) 
is 

3 

x'i = x< + a { + 2 (23.04) 

fc=> o 

For an infinitesimal transformation the distinction between the given and the 
transformed frame is not important and therefore the differences 

Ax< = x'i - xi (23.05) 

can be treated as the components of a vector referred to either system, say to 
the first one. So the expressions 

3 

Ax 1 = a* + 2 Cka i!c x k (23.06) 

represent an infinitesimal contravariant vector. Hence the constants a 1 them¬ 
selves form such a vector and the quantities co** a contravariant tensor. This 
tensor is antisymmetric, for on inserting (23.02) into the orthogonality con¬ 
ditions for the coefficients of a Lorentz transformation 

2 (23.07) 

i=*0 

and neglecting terms of second order in the infinitesimals we get 

+ w m = o (23.08) 

An antisymmetric tensor of rank 2 has six independent components. These 
six quantities together with the four components of the infinitesimal displace¬ 
ment vector a 1 are the ten independent parameters that characterize the 
infinitesimal Lorentz transformation. 

Let us perform two infinitesimal transformations consecutively, one with 
the parameters a 1 , u> ik , the other with parameters ¥, <p ik . It is easy to see that, 
neglecting second order terms, the combined result of the transformations is 
equivalent to a single transformation with the parameters 

c< = a < + ¥ ; = co‘* + <p<* (23.09) 

This shows in particular that the result of two infinitesimal transformations 
does not depend on the order in which they are performed. We can therefore 
consider a displacement of origin, a change to a moving frame and an infinitesimal 
rotation of the axes separately and in turn. 

Putting 

a° = ex; a 1 — a x \ a 2 = ; 


a 3 —a z 


(23.10) 
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we get a transformation describing a change of time origin by t to an earlier 
instant and a shift of spatial origin backward along the vector a and a distance 
equal to the magnitude of a. The passage to a frame moving with an infinites! 
mal velocity V is represented by the transformation 

x' — x = Ax — —V x t 

y' -y = Ay = -V y t (23.11) 

z' — z = Az = — Fzf 

of the space coordinates together with the transformation 

t' — t — At =-j (xV x + yV y + zV z ) (23.12) 

c 


of the time. 

Going from three- to four-dimensional symbols and comparing coefficients in 
(23.11) and (23.06) we get 


co 


10 __ _ 




CO 


30 _ _ 


Vz 

c 


(23.13) 


On the other hand, comparing coefficients in (23.12) and (23.06) gives 


co 


oi _ 


F*. 

c 



CO" 


Vz 


(23.14) 


a result that is obvious from the antisymmetry of the tensor co‘*. 

Let us finally consider an infinitesimal rotation of spatial axes. By a well- 
known formula in kinematics we have 


Ar = [to x r] 


(23.15) 


where to is the vector of an infinitesimal rotation, having components equal to 
the angles of rotation about the axes of x, y and z. 

Comparing (23.15) with (23.06) we get 


co“ J = 


-co 


32 _ ... ■ 

— CO* , 


C0 oi — —CO* 


<oy; 


CO 


12 = _ f ,l21 


- CO 2 


(23.16) 

We also write down the co variant components co<*. Lowering indices by the 
usual rule we get 


<*>io — 





(23.17) 


<*>23 — <*>£ > <**31 — Co3/ ; 0*12 — <*>Z 

with all other components following from antisymmetry. 

In conclusion we remark that it is possible to obtain the equations of finite 
Lorentz transformations from a study of the infinitesimal transformations, but 
we shall not discuss this question here. 


24. The Transformation Laws for the Electromagnetic Field and the 
Covariance of Maxwell’s Equations 

Our derivation of the basic laws of the Theory of Relativity started from the 
propagation law for an electromagnetic wave front. This law is a result of 
Maxwell’s equations as was shown in Section 3. We must now verify that 
Maxwell’s equations are really co variant with respect to Lorentz transformations. 
To do this we must first find the law according to which an electromagnetic 
field transforms in going from one reference frame to another. 
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It is well known that the electromagnetic field can be expressed in terms 
of a scalar and a vector potential by the equations 


(24.01) 


_ 8A 3 

dA 2 

Ei = 

1 8A X 

d <D 

dx 2 

dx 3 

c dt 

dx x 

dA x 

dA 3 

E. 2 = 

1 dA 2 

&■ D 

dx 3 

dx x ’ 

c dt 

1 N 
Is 

_ dA 2 

dA x 

e 3 = 

_ 1 dAs 

0® 

dx x 


c dt 

dx 3 


The field becomes zero when, and only when, the linear differential form 

89 = —cO dt + A x dx x + A 2 dx 2 + A 3 dx 3 (24.02) 

is a total differential. We assume that this linear form is invariant or scalar. 
This is so if A v A 2 and A 3 are the spatial (covariant) components of a four¬ 
dimensional vector whose fourth component is 

A 0 = -® (24.03) 

Bearing in mind that ct — x 0 we can write 89 in the form 


The condition 


89 = 2 Ai dxt 

<=o 


div A -1-- 0 

c dt 


(24.04) 


(24.05) 


usually imposed on the potentials is invariant under Lorentz transformations 
because it can be written as 


dxt 


0 


dA i 

Z ei 

i =0 

or, with the introduction of contravariant components, by 


A° = A„= — ® 


A 1 = —A x ; A 2 = -A 


2 > 


A*=-A s 


as 


2 ^=° 


0 


(24.06) 

(24.07) 

(24.08) 


Inserting x 0 = ct and A 0 = —® into (24.01) we rewrite the expressions for 
the fields in the new notation. For the electric field we get 

dA 0 dA x 


■^1 Cl™ o™ 


£2 = 


E 3 = 


dx x 
dAo 
dx , 
SAo 
dx 3 


8x o 
dA 2 
dx 0 
8A 3 
dx o 


(24.09) 


i/o/q 

while the expressions for the magnetic field remain unchanged. All the 
equations can be combined in the short form 

dAic dAi 


Fa — 


8x t dxic 


(24.10) 



The Theory of Relativity in Tensor Form 


69 


where we have 

Hi = F 2S Ei = F 10 

H 2 = F 31 E t = F 20 (24.11) 

H 3 — F 12 E 3 = F 30 

If (A{) is a covariant vector, then (Fa) is an antisymmetric tensor of second 
rank. Thus our assumption as to the vectorial nature of the potentials leads 
to the conclusion that the electromagnetic field is an antisymmetric tensor in 
the four-dimensional space-time manifold. From this it is easy to verify that 
Maxwell’s equations are covariant for Lorentz transformations. We construct 
from derivatives of field components the third rank tensor 


_ SFoc dFjci cFu 

8 xi dxi dxjc 


(24.12) 


It is evidently antisymmetric. By (22.09) we can adjoin to it the pseudo-vector 


F° = —F 


123 > 


* 

F 1 


230 I 


F 2 = F 


310 > 


h = F, 


(24.13) 


We calculate the components of this pseudo-vector and find 


SF , 


8F, 


8F 


12 


8 H i 811 2 


F 1 = 

* 

F 2 = 

* 

F 3 = 


8 xi 

dx 2 

dx$ 



8 x 1 

8 x 

2 

SF 23 

SFq 2 


sf 30 — 

1 

8 H 1 

8 E\ 

+ 

8 E 3 

8 x 0 

SX 3 

i 

dx 3 

c 

81 

dx$ 

8 x 2 

SF 31 

SF 0 3 

4- 

SF 10 

1 

sh 3 

se 3 

+ 

8 E 1 

8 x 0 

8 xi 

i 

dx% 

0 

8 t 

8 x x 

Sx 3 

SF 12 

SF 0 i 

i 

SF 20 _ 

1 

SH 3 

8 E 1 

+ 

8 E 2 

8 x 0 

dx 2 

4^ 

8 X 1 

c 

8 t 

8 X 2 

8 X 1 


8 H 3 

- — = - div H 
dx.. 


(24.14) 


For ease of interpretation we have replaced x 0 by ct on the right-hand sides. 
By Maxwell’s equations 


,,, 1 8 H 

curl E H-— = 0 ; 

c at 


div H = 0 


(24.15) 


so that the right-hand sides of all equations (24.14) are zero. 

Therefore the first set of Maxwell’s equations can be written in the covariant 
form 


Ft = 0 


(24.16) 


or 


Fat = 0 (24.17) 

We now turn to the second set of Maxwell’s equations, passing first from 
covariant field components to contravariant ones: 

F 23 = Hi F 10 = — Ei 


F 31 = II 2 F 20 = -E 3 

F' 2 =-H 3 f*°=-e 3 

We form the contravariant vector 


(24.18) 


= 2 


k 0 


8 F ik 

8 x k 


(24.19) 
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0 + 


dF 01 8 F° 2 8 F 03 8 E, 8 E, 

i i i 


8 x x 

8 F 10 

--f -0 


+ 


cte. 


8 x 2 vjsq 

8 F 12 a ^ 13 

+ 


&e 0 &c 2 

8 F i0 8 F 21 

+ — + 0 + 


a « 3 

0 J 23 
8 x , 


Sa; 0 daq 

a ^ 30 a ^ 31 aF 32 

s-=-1-1-f- 0 =- 


a*! 8 x, 

1 8 E 1 
c 8 t 

= _1 0 £, 
c 8 t 
1 0 F„ 


BE 3 

3*3 
0//, 8 H, 


8 x 0 8 x± ox 

so that the zero component of s ( is 

s‘ 


c a< 


a* 2 
0 # x 
8 Xo 

djh 

8 x 1 


8 x„ 

djh 

8 x x 
8 H 1 
8 x 2 


(24.20) 


div E 


(24.21) 


while the spatial components are equal to the components of the three- 
dimensional vector 


i aE 

(s l , s 2 , s 3 ) = — ——hcurlH 

c ot 


(24.22) 


In free space the right-hand sides of these equations are zero and therefore 
the second set of Maxwell’s equations in free space can be stated in the form 

3 8F tk 

5 = 2 = ° 


*=o 


8 x k 


(24.23) 


In a space occupied by charge of density p the Maxwell-Lorentz equations 
have the form 


div E = 4rcp ; 


culH-i —= -j 

c 8 t c 


(24.24) 


where j is the current density. The covariance of Maxwell’s equations is there¬ 
fore assured if the quantities 


4rtp; 


.<!< = — 


4tC . 
3* 


a = 1, 2, 3) 


(24.25) 


represent the contravariant components of a four-dimensional vector. By 
the identity 



A 0 2 .F<* 

> — 0 
x—< dxidxjc 
i,k —o 

(24.26) 

we have 


(24.27) 

whence 

+ div j = 0 

8 t 

(24.28) 


As is well known, this equation expresses the law of conservation of charge. 

The fact that the quantities (24,25) form a four-dimensional vector is 
perfectly consistent with the physical interpretation given to them by Lorentz. 
According to him j = pv where p is the density of the charge and v its velocity. 
The four-dimensional current vector 


pc, ptq, pv 2 , P« 3 


(24.29) 
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is proportional to the four-dimensional velocity vector 

c , »l 


M" 


u‘ — 


V(i -v^y 


V(1 - « 2 /c 2 ) 

^3 


V(1 - « 2 /c 2 ) ’ VC 1 - ^/e 2 ) 

(see (20.18)) with the coefficient of proportionality 


P" 
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(24.30) 


(24.31) 


This quantity is an invariant which has the physical significance of the charge 
density in that reference frame in- which the charge is momentarily at rest. 

We have verified the covariance of Maxwell’s equations and have established 
that the transformations undergone by the electromagnetic field when going 
from one reference frame to another are those of an antisymmetric tensor of 
the second rank. We shall now state the transformation laws in an explicit 
form, introducing for the general coefficients of the Lorentz transformation 
their values for the case that the transformation does not involve a spatial 
rotation. We then have 


®oo — 


Oi 


^00 y . 

c *’ 


V(1 - F 2 /c 2 ) 


-~V ( (i = 1, 2, 3) 


aa - —Sa — (a 00 — 1 ) 


ViVt 

F 2 


a * = i, 2 ,3) 


(24.32) 


(24.33) 


Inserting these values into the general expression 

3 

Fit — eiet 2 a i]®kiFji 
1 , 1=0 

and using the antisymmetry of F ik we get 

F 2 3 — a oof 33 — («00 ~~ 1) ( Fi-F23 + V 2 F 31 + V 3 F 12 ) 

~ ®oo (F 2 -F 30 V 3 F 2 0 ) (24.34) 

and two similar expressions obtained from (24.34) by cyclic permutation of the 
suffixes 1 , 2 and 3. We also get 

F\ 0 — a 00 F 10 — (o 00 — 1) — (Fj^jo V 2 F 20 + V 3 F 30 ) 


+ -*oo(V 2 F 12 -V 3 F 31 ) (24.35) 


and two expressions obtained by cyclic permutation. Using (24.11) to pass to 
the usual notation for the electric and the magnetic fields and using three- 



(24.36) 
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dimensional vector algebra, we may write 

K = a 00 E , - (o 00 - 1) (V • E) + [V x II] t 

Il[ = «„„//, - (a 00 - 1) ^ (V • H) - [V x E], 

Passing from the components to the vectors, replacing a 00 by its value and 
regrouping terms a little, we obtain the final expressions 

V 1 

E' = — (V • E) 4 - 

F 2 v ' V(i - f 2 /c 2 ) 

and H' = ^ (V • H) -f —- - - 

V 2 v ’ y/(\ - F 2 /c 2 ) 

For comparison we shall also state the transformation formulae for a four¬ 
dimensional covariant vector, using the usual vector notation for its space 
components. 

< = (A 0 + - c (V • A)) 


( E -l(V-E)+^[V X Hjj (24.37) 
( H — (V • H) — ^ [V X E] j (24.38) 


(24.39) 


((V • A) + 1! (24.40) 


V(1 - F 2 /c 2 ) 

V ] V 

and A' = A-(V • A) 4-• — 

F 2 V ; ^ V(1 - F 2 /c 2 ) F 2 

We note that the equations for the transformation of the fields give 

E' • V = E • V; H' ■ V = H • V (24.41) 

This means that those parts of the electric and the magnetic field which are 
parallel to the velocity V remain unchanged. On the other hand for the spatial 
part of a four-dimensional vector the part perpendicular to the velocity remains 
invariant, for we have 


A'^Of-AO-A-Ipr.*) 

If we consider the particular Lorentz transformation with 

F* = F ; V y = V z = 0 
the transformation equations simplify, giving 


(24.42) 


(24.43) 


E' = 2? ; 


i 


E y ' ,/M — F2/r2^ C ^ 2 ) ’ 


V(1 - F 2 /c 2 ) 

E ' 1 
2 VI 1 -- F 2 /c 2 ) 








v V(1 - F 2 /c 2 ) 
z V(1 - F 2 /c 2 ) 


(*' + C * 2 ) 


(24.44) 

One can construct two combinations from the field components that are 
invariant under Lorentz transformations. We have 


and 


E '2 _h ' 2 = E 2 — H 2 

E' • H' = E • H 


(24.45) 

(24.46) 
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The first of these quantities is also invariant under improper Lorentz trans¬ 
formations (see Section 22) and is a scalar. The second changes sign in improper 
transformations, and therefore is a pseudo-scalar. This can easily he seen in 
tensor notation. We have 


1 ^ 

E 2 — H 2 = — - J F ™ Fik 

i,k=0 


(24.47) 


and 


EH = - 

4 


^ * 

2 F ^ Fik 
i, k^o 


(24.48) 


♦ 

where F ik is an antisymmetric pseudo-tensor related to the field tensor by 
equations analogous to (22.06) and (22.07). We note that for a plane wave 
the quantities (24.45) and (24.46) vanish (in all frames). 

As do Maxwell’s equations themselves, the transformation equations for the 
field reveal a very close relation between the electric and the magnetic field. 
For instance, a field due to charges at rest in some frame will be an electrostatic 
one in that frame; but in another reference frame which moves relative to the 
first, a magnetic field is found in addition to the electric one. This is readily 
understood since a moving charge represents a current. Similarly a field that is 
purely magnetic in one frame will appear in another frame as a superposition 
of a magnetic and an electric field. 


25. The Motion of a Charged Mass-Point in a given External Field 

We now consider the motion of a particle of mass m and charge e in a given 
external field. As we already saw (Eq. (20.23)) the four-dimensional vector 
of acceleration has the components 

1 d 


w v — 


V(1 — v 2 jc 2 ) dt \V(1 


i( _£_ ). 

dt \V(1 - v 2 /c 2 )/ ’ 


w { — 


d / 1 dxA 

dt \\/(l — v 2 jc 2 ) dt / 


(i = 1, 2, 3) (25.01) 


V(1 - v 2 /c 2 ). 

The quantities here being differentiated with respect to time are the com¬ 
ponents of four-dimensional velocity 

“ , = V(1 - ; (i “ W) (25 ' 02) 


(25.03) 


We introduce a primed reference frame such that in it the instantaneous 
velocity of the particle is zero. We then make the assumption that in this 
frame, which is instantaneously travelling with the particle, the acceleration 
as usually defined is proportional to the electric field 




(25.04) 


dt' 2 m 

This equation must now be stated in the original frame. 

In the primed frame the four-dimensional velocity and acceleration are 
m'° = c, u' 1 = u' 2 = u' 2 = 0 (25.05) 
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JPx'l 

and w'° = 0, w'i = —- (25.06) 

U/t 

Thus in the primed frame the relation between the four-dimensional accelera¬ 
tion and the field is 

w'° = 0, w't = - E' (25.07) 

m 

Using the transformation rules for a contravariant vector (which are the same 
as the transformation rules for the coordinates) we find that 

w ( = 2 ejcaiciw' k (25.08) 

o 

Inserting into the right-hand side the values for w' k from (25.07) we get for the 
four-dimensional acceleration in the original system 

^ KK + a 2i E’ 2 + 3i aE' a ) (25.09) 

The values of the a« must be inserted into this from (24.32), giving 


and 



(25.10) 

(25.11) 


Finally E' must be expressed in terms of E and H. Since V • E' = V • E, 
equation (24.36), with a 00 replaced by its explicit value, gives 


w° 


e 1 
m V(c 2 - F 2 ) 


(V-E) 


(25.12) 


and 


w 1 


= - -j rr f 2 x ( e + - tv x hj) 
m v(c 2 — F 2 )\ c ]i 


(25.13) 


Here V is the velocity of the frame moving with the particle; at the instant 
under consideration V coincides with v, the velocity of the particle. There¬ 
fore we can rewrite the previous equations as 


w° = 


e 1 
m -\/(c 2 — v 2 ) 


(v-E) 


(25.14) 


and 


w l 


e c 

m V( c2 — v 2 ) 


E 


l r 

- [v x 



(25.15) 


Now we can abandon the moving frame and take it that the equations just 
obtained are valid at any instant of time. They are thus the equations of motion 
of the particle. However, the form of the equations (25.14) and (25.15) is 
inconvenient owing to the mixed notation used; on the left we have a four¬ 
dimensional vector while the right-hand side involves three-dimensional 
quantities. We now make the notation uniform, by first stating the equation 
entirely in four-dimensional and then entirely in three-dimensional terms. 

Using the equation (25.02) for the four-dimensional velocity and equation 
(24.11) for the field, we can write equations (25.14) and (25.15) in the form 


w 0 = —(w 1 ^ 0 + u 2 F 20 + u 3 F 30 ) 
me 


(25.16) 
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and w* = —(u°F i0 +uiF {1 +u*F {2 +u°F i3 ) (* = 1,2,3) (25.17) 


Because of the antisymmetry of the tensor Fa the right-hand sides of all 
equations (25.17) contain in fact three and not four terms. Passing to the 
covariant vector of acceleration 

w 0 — w °; w 1 = —w 1 ; w 2 — — w 2 ; w 3 — —w 3 (25.18) 


we have for all four values of i 

3 

Wi = - — V tfiFu (25.19) 

me t—i 

A:=0 

On either side of this equation we have a covariant vector so that it is obvious 
that the equations of motion, written in this form, are valid in any reference 
frame, as it should be. It is easy to verify that 

2 u*w ( = 0 (25.20) 

t=o 

in agreement with (20.24). This means that the first equation of motion is a 
consequence of the three others. 

In view of the covariance of equations (25.19) it would have been sufficient 
for proof of their validity, to verify that in the primed frame they reduce to 
the form (25.07). 

We now pass to the three-dimensional statement of the equations. Using 
the expression (25.01) for the four-dimensional vector of acceleration and 
introducing three-dimensional vector notation we have 


and 


d I me 2 \ , _ v 

d l mv \ /_ 1 r 

-4w^>)=v + -c lvxHI ) 


(25.21) 

(25.22) 


In (25.21) the right-hand side is the power, or rate of work done by the field 
on the particle ; by general mechanical principles one should therefore interpret 
the left-hand side as the rate of increase of the kinetic energy of the particle. 
In (25.22) the right-hand side is the Lorentz force on a moving charge and the 
left-hand side the rate of increase of particle momentum. 

Thus the kinetic energy and the momentum of the particle can differ only 
by constants from the expressions 


W = 


me 2 

V(1 - » 2 /c 2 ) 


(25.23) 


and 


P = 


mv 

V(1 - v 2 lc 2 ) 


(25.24) 


whose time derivatives occur in the equations of motion. 

The values of the constants can be determined by a covariance requirement; 
one must postulate that the energy and the momentum of a particlef form a 
four-dimensional vector. 

Now, the quantities (25.23) and (25.24) are themselves a four vector, for 
t More accurately the energy divided by c and the momentum. 
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they are respectively proportional to the temporal and spatial components of 
the four-dimensional velocity. Therefore the constants must vanish .% Con¬ 
sequently the quantities (25.23) and (25.24) are the energy and momentum 
of the particle. 

If we take v = 0 then, in agreement with older ideas, P = 0, the momentum 
of a body at rest is zero ; but for v = 0 the energy W takes on a value different 
from zero, namely 

W 0 = me 2 (25.25) 

This result disagrees with older notions, but is fully confirmed by experiment: 
any body of mass m represents a store of energy IF 0 (see Section 34, below). 
Asimilar relation 

IF = Me 2 (25.26) 

between the mass and the energy of a body holds also at arbitrary velocity 
if M is understood to mean the quantity 


M = 


m 

V(1 - »*/<>*) 


(25.27) 


which is itself a function of the speed. The quantity M must be taken as the 
rational generalization of the concept of mass. It enters as the multiplier of 
the velocity in expression (25.24) for the momentum and so describes the inertia, 
or inertial mass, of the body. M is usually called simply the mass while m 
is termed the rest-mass. Correspondingly W is referred to as energy and W 0 
as rest energy. 

The rest-mass to is a constant independent of the state of motion of the 
particle as a whole, which may, however, depend on the internal state of the 
particle, if the latter has some complex structure and some internal degrees 
of freedom. The rest-mass can be expressed in terms of the invariant of the 
four-dimensional vector of energy-momentum. Indeed, this invariant is 
independent of velocity and equal to 


W 2 

— - P 2 = m 2 c 2 (25.28) 


The mass M depends on velocity in such a way that if the particle approaches 
the speed of light its mass increases without limit. Consequently no body 
having a rest-mass different from zero can attain the speed of light in any 
frame of reference. This fact confirms strikingly the limiting property of the 
speed of light. 

If the speed of a particle is small compared to the speed of light the difference 

1 3 v 4 

II — me 2 = - mv 2 h - to — + ... (25.29) 


will only differ by small quantities from the usual expression for the kinetic 
energy. 

The foregoing relations were derived for the equations of motion of a charged 
+ A more detailed argument goes as follows : the constants must on the one hand form a 
vector, on the other they must he independent of the state of motion and, hence, of the frame 
of reference. However, a constant vector different from zero, with components independent 
of the frame, does not exist. 



77 


The Theory of Relativity in Tensor Form 

particle but they have more general validity. The relation (25.26) between 
mass and energy holds not only for the present case of a particle of invariable 
rest-mass, but also for any complex body or system of bodies in which internal 
processes can effect changes of rest-mass. The relation expresses the funda¬ 
mental law of proportionality of mass and energy. We shall return to a dis¬ 
cussion of it in Section 34. 

26. Approximate Description of a System of Moving Point Charges 

The problem of the motion of a system of charged mass points requires a 
simultaneous determination of the motions of the charges and of the electro¬ 
magnetic field in which they move ; only an external field can be considered 
as predetermined. The field transmits the interactions between the charges 
with a finite velocity, namely c. Consequently the force acting on a given 
charge will depend not on the momentary state but rather on the past state 
of motion of all the other charges. The field arising in the accelerated motion 
of charges not only carries the interaction but also radiates into outside space, 
therefore the energy of the charges will be spent in part on radiation and the 
system of charges is not a conservative one. Also it must be borne in mind 
that the field possesses an infinite number of degrees of freedom and therefore 
the system of charges with their field is strictly speaking one with infinite 
degrees of freedom and thus not a purely mechanical one. 

Nevertheless the problem of the motion of charges can be approximately 
formulated in mechanical terms, involving only the degrees 'of freedom of the 
charges themselves. To do this one must express the field, or rather the 
potentials, of each charge in terms of its position and velocity and, further, 
one must find an approximate way of relating quantities at an earlier to 
quantities at a later instant. The potentials at a given place and given time 
are in this way expressed approximately in terms of the positions and velocities 
of the particles at the same instead of at an earlier time. An approximate 
Lagrangian for a system of charged particles may thus be constructed. 

We find first the Lagrangian form of the equations of motion of a single 
charged particle. It is easy to verify that equations (25.22) can be derived 
from the Lagrangian 

L = ^1--j — e<I> + - (A-v) (26.01) 

where <1> = — A 0 and A* are the scalar and vector potentials respectively. For, 
since it = in we have 


dL mvi e 

dxi ~ v-/c~) c ^ 1 


dL f<b 

d‘Xi dXi 


e s-' 8A/c 
+ -S7” 1 

k 1 


(26.02) 
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Inserting these expressions into Lagrange’s equations 


d/8L 

\ 8L 

dt\8xi 

) 8xi 


(26.03) 


and remembering how the fields are expressed in terms of the potentials by 
(24.01) we get 

+ <26 - M) 

which is the same as equation (25.22) and of which we also know (25.21) to be 
a consequence. 

We now pass to the case of a system of particles. We seek an approximate 
expression for the fields of particles of known motion. Considering first a 
spatial distribution of charge we can write down the equations for the po¬ 
tentials as 


1 d 2 ® 
AO- 

c 2 8t 2 


—4itp 


and 


_ 1 8 2 A _ _ 4n. 
c 2 8t 2 c J 


The solution of (26.05) that is required is the retarded potential 

-Jl^ 

where 


(26.05) 

(26.06) 

(26.07) 


1 


[p] = P(r\ 0 ; t' — t -|r — r'| (26.08) 

c 

If the motion of the particle is sufficiently slow and sufficiently smoothf then 
for not too great distances the expression for [p] may be replaced by the first 
few terms in an expansion in inverse powers of c 

H = rtf, 0 |r - *\ | PC'. 0 + 2 1 K - r'|>^ p(r',,) + ... 

(26.09) 

Inserting this expansion into (26.07) and using the fact that the integral 

J p(r', t) dV' = e a (26.10) 

is the charge of the particle and therefore independent of time, we get 

r P (r',t)dr 2* 

J |r — r'l + 2 c 2 dt 2 J py ’ ’ 


■r'l dV' +... (26.11) 


If we now assume that the charge is concentrated near the pointj 

r' = r 0 = T a (t) 

we obtain from (26.11) 

- e “ + c ° 82 |r r| 

r — r 0 + 2c 2 at»l I 


(26.12) 

(26.13) 


f These requirements impose a limitation on the speed of the particle (v 2 ^.c 2 ), on its 
acceleration and on higher time derivatives. 

{ The suffix a in e a , r 0 , v a , etc., now denotes the number of the particle. 
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Putting T a (t) = \ a (26.14) 

and perfor min g one of the time differentiations in (26.13) we have 


<D = 


e a 


e<t 
26 s 




l r - r “l 

Here the first term gives the Coulomb potential, as for a charge at rest, while 
the second is a retardation correction. 

In the expression for the vector potential we need not include any correction 
terms § and we may write it as 

Ai ~ c|r — r 0 | (26J6) 

(This expression for the vector potential is obtained if in (26.06) second deri¬ 
vatives with respect to time are neglected, if the current density j is replaced 
by pv„ and if the transition to a concentrated charge is made.) 

The form (26.13) of the scalar potential is inconvenient because it involves 
not only the velocity v 0 but also the acceleration v„ of the particle which 
produces the field. This inconvenience can easily be removed by using the fact 
that the fields are not changed by a “ gauge transformation ” of the potentials, 
i.e. by the substitution 


<D- 

A t 


■d> + ~ 
c 8 t 


At- 


8 xt 


(26.17) 


where y is an arbitrary function of space and time coordinates. We choose 

ea Va ■ (r To) 


y. 


(26.18) 


2 c |r—r 0 | 

and thereby cancel the second term in (26.15) at the expense of complicating 
the expression for At a little. The result is 

e a 


<D = • 


(26.19) 


and 


V' V a lc(Xlc — Xalcj(pt •Vat) j 


(26.20) 


. _6a / Vai 

,_ 2^l|r-r 0 | |r-r.| 

It is easy to verify that the new expression for the vector potential has vanishing 
divergence: 

•A 8Ai 

2 —0 ( 26 . 21 ) 

Insertion of these values for the potentials into the last two terms of the 
Lagrangian (26.01) gives 


e ee a 


+ 


eea I (Wq) 
2c 2 \ |r — r 0 | 


{v«-(r-r 0 )}{v-(r -r a )}\ 


r — r a 


(26.22) 

§ In the Lagrangian and in the equations of motion the terms in the vector potential are 
of the same order as the terms involving corrections to the scalar potential. 
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This expression is symmetric in the particle producing the field and the particle 
being acted on. We can consider it to be an approximate statement of the 
interaction law for two particles. It allows us to write the Lagrangian of a 
system of particles as 


L = 


WHIH2 


6 

( 0 ^ 6 ) 


Co® 6 

|r a - re, 


a, b 
(a^b) 

Incidentally, since quantities of order v 2 /c 2 are assumed small in the interaction 
law it will be more consistent to neglect higher terms in this quantity every¬ 
where. Therefore taking only the first terms in the expansion of the square 
root we can replace (26.23) by 


K'Vft) {Vg'( r a -.-r„)}{v 6 -(r a — r fc)} \ 

— r&j |r 0 — r 6 | 3 / ^ < 


i - - if. + >»„»; + \ | - 1 2 |77?k| 


j /6 


+ A 2 f (V , - ,V,) r + (26.24) 

aTb \ r « j r « - T l> / 


where 


, b 
(a 1 b) 


H’ n c 3 2 Ola (26.25) 

a. 

is the sum of the rest energies of all the particles. 

We now consider the question of Lorentz transformations in the problem 
of the motion of a system of particles. Assume the problem to be solved in 
some frame of reference. This means that, in the given frame the spatial co¬ 
ordinates of each particle are known as functions of the time in the frame : 


r a = r a (/) (26.26) 

We now perform a Lorentz transformation and seek to express the new space 
coordinates x' a , y a , z’ a as functions of the new time V. The equations of the 
Lorentz transformation have the form 


C _r - VI , l) L { V.(r .VO, (26.27) 

“ d - V(1 - v>n (‘ - ? (V - r) ) (26 - 28 > 

Therefore the new functions of the new time can be obtained by eliminating 
the variable t from the equations 

f. -r.(l) - VI + ( V( ,_ 1 F , W - l) (V-WD ~ V,}) (26.29) 

“ d ' - v(T - j-Ffr.) (‘ - r- ■ <y- i 26 - 30 ' 


In other words, the solution of (26.30) for t must be found and inserted in 
(26.29). This can only be done approximately taking F 2 /c 2 to be small com¬ 
pared with unity. The approximate solution of (26.30) is 
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(26.31) 


Before inserting this into (26.29) we write the latter in the simplified form 

V 


where 


K = r a (t) - \t + — ({V -r 0 (<)} - VH) 

Inserting now, we get approximately 

<(0 = - Vt' + ± v a (0 ■ (■- l vV + {V • r„(0}) - ^ {V • r o (0} 


(26.32) 


Va(l 




(26.33) 

(26.34) 


Differentiating (26.33) with respect to t' we obtain the corresponding expression 
for the velocity 


^(0 = y a (0 - V + v a (0 (■- lv* + (V • v.(0}) 

{V-Va(0} + w(- ^ FV + {Vt,(0}) (26.35) 


V 

2 c* 


where v a is the derivative of v« with respect to its argument. 

We note that the relation between t and t' depends on the number a of the 
particle; this is understandable because the transition to the new definition 
of simultaneity requires the introduction of different corrections for different 
particles. The old definition of simultaneity used in the original frame involved 
taking the same value of t for all particles (so that the t' values were different), 
but with the definition of simultaneity used in the new frame the values of t' 
for all particles are taken equal, so that the t are different. We can there¬ 
fore omit the suffix a on t'. We can also replace t' by the unprimed symbol t 
and omit this argument from all functions in (26.33) and (26.35). We can then 
write these equations more briefly as 


and 


1 / 

/ 1 TT \ v 

+ 

> 

1 

n a 

li 

-(H 8 

r2 ™ + (V.r„))-_(V.r„) 

, , 1 

( 1 „ \ V 

V a = V a- V +- 2 V 0 | 

h p +(y' v .))-^(H) 


1 . / i \ 


+ r* v “( _ 2 F< + (V ' r » ) ) 


(26.36) 


(26.37) 


This is the form of the new functions r' a and v^ expressed as functions of the 
new independent variable which, however, we have denoted by the same 
symbol as the old one. 

The equations derived are approximate but on the assumption that the 
speeds V and v a are of the same order the terms omitted are of the same order 
as those that are inevitably neglected in the construction of the retarded 
Lagrangian. Thus the omissions are in conformity with the nature of the 
problem. 

It is well known that equations of motion are obtained by varying the action 
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( 2 > 

S = J L dt (26.38) 

<i> 

and that in the usual case that the Lagrangian depends only on the coordinates 
and the velocities, and possibly the time, the equations of motion are 


d dL dL 

dt dXct cKr q. 


(26.39) 


If the accelerations also enter the Lagrangian the equations of motion take 
on the form 


d 2 dL d dL dL 
dt 2 dx a + dt dx a dx a 


etc. 


(26.40) 


where x a , y a and z a are the components of acceleration of the o-th particle. 

If the action integral S is invariant with respect to a Lorentz transformation 
the equations of motion will obviously be covariant. 

Let us consider, for example, the equation of motion of a single particle, as 
derived from the Lagrangian (26.01). In this case the action integral can be 
written as 

S — —me 2 f dx + - f ^ (26.41) 

(with x 0 = ct). Its invariance follows from the invariance of the differential 
of proper time and of the differential form (24.02) in the potentials. 

However, the invariance of the action integral is not a necessary condition 
for the invariance of the equations of motion; it is sufficient that a Lorentz 
transformation should leave the variation of the action integral unchanged. 
This can be seen for example in (26.41) if one subjects the potentials to a gauge 
transformation of the type (26.17) with some function y. Then S changes into 

S' = S_f( x ( 2>_ x u>) (26.42) 

where y (I) and y <2 > are the values of y at the two limits of the integration. 
Since the variations vanish at the limits, we have 

SS' = SS (26.43) 

and the equations of motion do not change, as it should be, since a gauge 
transformation of the potentials does not affect the fields. 

We apply these considerations to our case of the motion of interacting 
charges and to the Lagrangian (26.23). We denote by L'(t') the expression 
obtained for the Lagrangian L(t) by replacing r a (t) by r' a (t') and v a (t) by V a (t') 
from (26.33) and (26.35). If the original action integral was 

( 2 ) 

S = j L(t)cU (26.44) 

(1) 

the transformed integral will be 

( 2 ) 

S’= ( L'(t')dt' (26.45) 

(i) 

and the change in the action integral due to the Lorentz transformation is 
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(a) • (a) 

S' - S = J L'(t') dt' - J L{t) dt (26.46) 

(i) (i) 

or, if we call the integration variable t in both integrals 

(a) (a) 

S' - S = J L'(t) dt- j L(t) dt (26.47) 

(i) ( a) 

Here L'(t) is the expression obtained from L(t) by replacing r a by r' a and v a 
by v' a according to (26.36) and (26.37). 

It follows from (26.47) that in order to ensure the covariance of the equations 
of motion it is sufficient that the difference 


{L'(t) - Lit)} dt = dF (26.48) 

is a total differential of some function F. For then 

S' -S =/< 2 > - F a) (26.49) 

where F w and F a) are the values of the function at the limits, so that the 
variations of the two integrals will be equal. 

By performing the calculation we can verify that (26.48) really holds with the 
function F given by 


F 


“?(• 


m a + - m a 




b r ° 

(b^a) 




-42 m °( v ■ r °^ v ■ 2™ a(v ■ +f,2 mat ( 26 - 5 °) 

a 'a a 

This also proves that the equations of motion derived from the Lagrangian 
(26.23) or (26.24) are indeed Lorentz covariant to the approximation stated. 


27. Derivation of the Conservation Laws in the Mechanics of Point 
Systems 

We now turn to the derivation of integrals of the equations of motion. To 
this end we consider the infinitesimal Lorentz transformations studied in Section 
23 in the form involving all ten parameters. We recall that a transformation 
with these parameters expressed the following : a change v of time origin, a 
displacement a of the origin of the spatial coordinates, a transition to a new 
reference frame moving with velocity V relative to the original frame and an 
infinitesimal rotation of the spatial axes characterized by the quantity to. 
We have to calculate the change, resulting from such transformations, in the 
functions of time that represent the coordinates of each particle. This variation 
Sr 6 has two causes, the first arising from the vectorial nature of rj and the 
second from the change in the argument t. 

According to (23.06) and (23.17) the former variation is 

Ar fe = a - Vi + [u x r 6 ] (27.01) 

The argument t appearing in quantities that refer to the 6-th particle is varied by 

-\(V -r b ) (27.02) 

c 

where the first term_comes from the change in the time origin and the second 
from the transition to the moving frame. (Equation (27.02) is one of the 
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equations of (23.06)). 

The variation A*r b due to the change of the argument of r b (t) is obtained 
from the equation 

r„(t) = r' b (l') (f' = t + At b ) (27.03) 

whence r' b (t) = r b (t - At b ) = r b (t) - \ b At b (27.04) 

and, therefore, A*r 6 = r' (t) — r b (t) = —v b At b (27.05) 

Thus the total variation in the form of the function r a (t) is 

Sr6 = Ar& + A*r b = Ar 6 - v„ A t b (27.06) 

Using (27.01) and (27.02) the total variation Sr* can now be expressed in terms 
of the parameters of the infinitesimal Lorentz transformation as follows : 


■ v&t + a 


■ V< + ~ 2 v*(V-r 6 ) +[«o x r&] 
c 


(27.07) 


(27.09) 


and evidently the total variation 8 v b of the velocity \ b will be 

8 v b = ~ 8r b (27.08) 

In the expression for Sr;, the terms containing V are the same as the terms of 
first order in V in equation (26.36), as was to be expected. 

We now calculate by two methods the variation of the Lagrangian in an 
infinitesimal Lorentz transformation. The first method is based on the use of 
the equations of motion, the second on direct transformation and the use of 
(26.48) and (26.50). We have 

Si -4W 8v ' + ir.' 8r -) < 27 - 09 > 

since the Lagrangian does not involve time explicitly. 

Here 8 Ljdr a is to be understood to mean the three-dimensional vector with the 
components dLjdx a , dLj dy a and dL/Sza and dLjd\ a is defined correspondingly. 
Using the equations of motion and also the relation (27.08) we can write 

d dL 

SL -i,2^.- Sr - (mo > 

a 

On the other hand, as mentioned before, we can evaluate the variation in the 
Lagrangian L without using the equations of motion. Evidently L does not 
vary when the spatial origin is displaced or when the spatial axes are rotated. 
If the zero of time is changed L varies by — t • dLjdt for if L'(t') = L(t) where 
t' = t + t, then 

L'(t) = L(t - t) — L(i) - (27.11) 

The change in las a result of passing to a moving frame is given by (26.48) 

where it is sufficient to take for F the terms of (26.50) which are of first order 

in V, namely 

f = 2(-“. + ^.|-£2y™r)< v ' r *> (27 ' 12 > 


. dL dF d 

SL=- T _ + _ = -(- Tj t + J) 


Thus 


(27.13) 
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Equating (27.10) and (27.13) we get 

< 27 - 14 > 

a 

_^ ^jr 

whence V • 8r ffl + L- ~ F = const. (27.15) 

^ as a 

a 

The left-hand side of this expression is a linear function of the ten parameters 
of the Lorentz transformation. Inserting into (27.15) the expressions (27.07) 
for §r a and (27.12) for F we can put 

I == -t W + a- P+ V- K + w- M = const. (27.10) 

where W, P, K and M do not depend on the parameters and are given by 

IV = 

P = 

K = 


M = 

The left-hand side of (27.16) must be constant whatever the values of the 
parameters t, a, V and to. This can only be if the quantities, W, P, K and 
M are themselves constant. Thus we have obtained ten integrals each of which 
is connected with a particular one of the ten parameters of the infinitesimal 
Lorentz transformation. The physical meaning of these integrals is easy to 
see. W is the energy, P the momentum, K the integral of mass centre motion 
and M the angular momentum. Thus the ten integrals we have found are the 
classic integrals of the theory of systems of mass points, but with corrections 
for Relativity. 

Let us write down the integrals explicitly. We have 


XT' 3L 

a 

ZftT a 


(27.17) 

(27.18) 


V / • * v % * e « V eb i 1 NT 

4 * \ 1 c 1 2c 2 4- |r„ — r & | c 2 dvj dv a 

2 [ r * x m 


b 

(b^a) 


(27.19) 

(27.20) 


W 


= c 2 V m, + | V m n v\ 
Z* * 2 Z, “ “ 



^ , 1 V e a e b 

° 2 2 zi\ Ta ~ Tb \ 

(a^b) 


1 e a e b (v a -Vb) 

4c* Z |r 0 — r b I 

a, 6 1 1 

a^b 



a, b 
(a^b) 


e a eb 

|r« — r„| 3 


{v a • (r„ - r 6 )}{v 6 • (r a - r 6 )} 
(27.21) 


This is the energy of the system. 

If we put 

W = c 2 2 m„ + E (27.22) 

a 

the quantity E will be the energy as it is usually defined, namely, so as to 
vanish for zero velocities and infinite distances between particles. Side by side 
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with the energy we shall consider the corresponding mass 

W E 

M = 7 = 2 + 7, < 27 - 23 > 

a 

which may be called the total mass of the system. By the definition just given 
the total mass of a system is equal to the sum of the rest-masses of all the 
separate particles and of the mass corresponding to their kinetic and inter¬ 
action energies. 

Let us now discuss the other integrals of motion. By (27.18) the integral of 
momentum is 


P = 



a 


V%\ 1 r e a e b / (r„ — r b ){v b • (r g -1>)} 

2c 2 / 2c 2 2-, Wa — rJ \ V ° |r a — r & | 2 

a, b 1 1 1 1 

ta*b) (27.24) 


To write the integral of the mass centre motion in an explicit form we introduce 
for brevity 


M* = ml 

a a\ 

1 1 \ 1 e a XT e b 

. ^ 2c 2 / 2c 2 4 , |r* — r 6 | 

(27.25) 


(a*b) 


Then we have 

K =lKr a -a> 

(27.26) 

It should be noted that 




m 

a * 

II 

—J 

(27.27) 


a 


where M is understood to mean the expression (27.23) taken to the same 
approximation as that in which M* is given : this corresponds to the non- 
relativistie approximation for the energy E. 

Introducing the position vector R of the mass centre of the system by the 
relation 

iWR = 2 M*r a (27.28) 

a 

expression (27.26) for K takes on the form 

K = MR - tP (27.29) 


the fact that K is constant represents the law of motion of the mass centre. 
Finally, the integrals of the angular momentum of the system are 


M = J »>«( 1 + |r 2 )[ r “ x 

a 

, 1 e a e b L , [r a X r & ]{v 6 • (r 0 - r„)}\ 

+ w 2 ivr^r l tr * x v,) - —) 


(27.30) 


a, b 
(a*b) 


28. The Tensor Character of the Integrals of Motion 

We must now investigate the behaviour of the integrals of motion under 
Lorentz transformations. If we knew from the start that 
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I = + a-P + VK + to-M (28.01) 

was an invariant we could decide the tensor character of W, P, K and M from 
the properties of the parameters t, a, V and ca, of the infinitesimal Lorentz 
transformation. Since, however, we have not proved the invariance of (28.01) 
we shall adopt a more direct approach, though it involves rather tedious 
calculations which we shall refrain from elaborating. We shall subject W, 
P, K andM to a finite Lorentz transformation and see how they can be expressed 
in terms of the initial quantities. 

We shall denote by IF', P', K' and M' the quantities resulting from putting 
and \' a in place of r 0 and v 0 , with the use of (26.36) and (26.37). 

We obtain 

/ 1 F 2 3 F 4 \ / 1 F 2 \ 

W'=( 1 +1 - + --)W-( 1 + ^)( v - p ) (28.02) 

W v ( / JF\) 

and F= P_V- + -jv.(p-V-)j (28.03) 

or, in terms of the total mass, 

/I 3 F 4 \ /I F 2 \ 
cm' = |c 2 + -F 2 + g - [l + - —J(V-P) (28.04) 

and P' = P - VM + ^ (V • (P - VM)} (28.05) 


To the same approximation these equations can also be written as 


cm' = 


cm - (V-P) 

V(1 “ V 2 lc 2 ) 


(28.06) 


and F = P _ VM + ~ l) < v '’(P - (28.01) 

Comparing these equations with the usual ones for Lorentz transformations 
(26.27) and (26.28), it is apparent that the components P x , P y and P z of the 
vector P transform like the coordinates x, y and z, and the total mass like the 
time t. This means that the set of quantities 
W 

P" = Me = -; P l = P x , F~ — P y ; P* ^Pz (28.08) 


is a contravariant vector. The covariant components of this four-dimensional 
vector are 

W 

P 0 = Me = — ; P l = P x ; P 2 = - P y ; P 3 = P z (28.09) 


If we consider the whole system of charges as one compound body, we must 
ascribe to this body the mass M and the momentum P. We can also ascribe 
to it, or more exactly to its mass centre, the velocity 


P 

M 


and the rest-mass 


- J( w 


P 2 \ 


(28.10) 


(28.11) 


The quantity u is equal to M' in that frame of reference in which P' = 0. It 
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is evident from this example that the rest-mass of a body depends on its internal 
state, in the present example on the state of motion of the constituent particles. 

We now turn to the transformation law for K and M. Making the same 
insertions from (26.36) and (26.37) we get for K' and M' 

K ' = (i + £) K - V(V-K) - I (V x Mj (28.12) 


and M' = (l -b M - A V(V-M) + (l + Q[V x K] (28.13) 
To the same approximation we can also put 

K ' = (V ' K » + vi T-Wi ( K - f‘ (V ' K> ~ h lv >; MJ ) 

(28.14) 

»nd M' ™ I (V-M) + - r -L r — (m - p(V-M) | |V x K]) 

(28.15) 

We compare these relations with those describing the transformation of an 
antisymmetric tensor, bearing in mind that in the three-dimensional sense K 
is a polar vector and M an axial one. The required transformation equations 
have been stated in Section 24 for the case of the electromagnetic field tensor 
(equations (24.37) and (24.38)). To make the two sets of equations coincide 
we must take it that the vector M transforms like H and the vector K like 
—E/c. The other possibility M ~ E, K ~ H/c is excluded because K and E 
are polar and M and H axial vectors. We can therefore introduce an anti¬ 
symmetric tensor in terms of its covariant components 


M 23 — M x ; 3/3i -- 3 1y ; M i, — M z , 

37io ~ cK x ; 3/ 20 = chy ; M 30 = cK z 

or equally well in terms of its contravariant components 

M** = M x ; 3/ 31 = M y ; M 12 = M z 

M 10 = cK x ; M 20 = cK y ; M 30 = cK z 


(28.16) 


(28.17) 


A check on the correctness of this is the observation that for a single particle 
M x ~ m(x 2 u 3 — x 3 u 2 ), etc. 
cK x = »«(x 1 m° — a^M 1 ), etc. 


(28.18) 


where a°, id, u 2 and u 3 denote the components of the four-dimensional velocity 
and where, to stress their contravariance, space and time coordinates have 
been written with superfixes : x° = et, x 1 = x, x 2 = y and x 3 — z. 

We have thus shown that of the ten integrals of motion four, namely energy 
and momentum, form a four-dimensional vector and the remaining six, the 
integrals of mass centre motion and the angular momentum, an antisymmetric 
tensor. This allows one to assert that if these quantities are constant in any 
one frame of reference they will be constant in all frames. The circumstance 
that we deduced the conservation laws from approximate equations is related 
to the approximate character of the statement of the whole problem. We 
have already remarked that strictly speaking the energy and the momentum 
of a system of charges do not remain constant but may be dissipated as radiation. 
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Having established the tensor character of the integrals of motion we can 
verify that the expression (28.01) is really invariant. To do this it is sufficient 
to rewrite the expression for I in four-dimensional notation. According to 
Section 23 we have 

t - - — Gtp j Ox —— a^ 5 ay <z 2 j az ~~~ n 3 (28.19) 

c 

where o 0 , a lt o 2 and a 3 form a covariant vector. 

Further 

f' x — cwjj ; V v = co> 2 q ; Vz — (28.20) 

and (Ox ~~ 0^23 j (Oy == , coz (28.21) 

where the w;* are the covariant components of an antisymmetric tensor. 
Using the four-dimensional notation (28.08) and (28.17) for the integrals of 
motion we get in place of (28.01) 

I — — a 0 P° — af 31 — a 2 P 2 — o 3 P 3 

+ co 10 M 10 + 1 o 20 M 20 + co 30 M 30 + w 2 3M 23 -f co 31 ilf 31 + co 12 M 12 (28.22) 
or, more briefly, 

3 1 3 

' I = - 2 atP* + ^ 2 ^ tkMilC (28.23) 

(=o i,k =0 

proving our assertion that I is invariant. 


29. A Remark on the Conventional Formulation of the Conservation 
Laws 


In this section we make some remarks of a critical nature on the conventional 
formulation of the conservation laws. For definiteness we shall deal with the 
conservation laws for energy and momentum as they are the ones most fre¬ 
quently discussed. 

One writes expressions for the energy, or the total mass, and for the momen¬ 
tum in the form 


and 


W = Me 2 = 


2 


m a c 2 

V(1 - «*/c*) 


(29.01) 


P = 


2 


m a v a 

v(i - "in 


(29.02) 


and makes two Statements about them : first that these quantities form a 
four-dimensional vector and, second, that they remain constant. 

In fact both statements are only correct in the case that the particles do 
not interact. Then, however, the velocity of each particle separately remains 
constant and each quartet of quantities 




m a c z 


V(i -- v 2 Jc 2 ) ’ 


p<a) 


WlaV a 


V(i -Klo 2 ) 


(29.03) 


is a constant four-dimensional vector. Thus, in the absence of interaction the 
constancy of the sums (29.01) and (29.02) follows trivially from the constancy 
of their separate terms (29.03) ; the summation tells one nothing new. 
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On the other hand, when interactions are present the expressions (29.01) 
and (29.02) do not remain constant and do not form a four-dimensional vector. 
The first part of this statement is easy to understand, for even in non-relativistic 
theory it is not the kinetic energy but the total energy of a system of particles 
that remains constant. 

The Theory of Relativity introduces corrections that take care of inter¬ 
actions not only into the expression for the energy but also into the one for 
the momentum of the system (see (27.21) and (27.24)). Only with the inclusion 
of these corrections will the two expressions be constant. As for vectorial 
properties, the following state of affairs might at first sight seem paradoxical. 
We saw in Section 25 (equations (25.23) and (25.24)) that the set of quantities 
(29.03) is, for a single particle, a four-dimensional vector even for accelerated 
motion; further, it would seem that a sum of vectors must also be a vector. 
In spite of this we assert that the sums (29.01) and (29.02) do not represent 
the components of a four-dimensional vector. The paradox is removed by the 
consideration that the quantities W ia) and P (al are functions of a time common 
to all particles. Their sums W and P are in fact sums of the values of W (a> 
and P <0) simultaneous in the given frame. In a different frame of reference these 
values will no longer be simultaneous ones. Therefore, when passing to a new 
frame one has not only to form linear combinations of the W (a) and P u) 
according to the vector rule but must also recalculate these quantities appropri¬ 
ately to the new simultaneity, as was elaborated in Section 26. Such a re¬ 
calculation changes nothing only in the case of constant W ia) , P <a) ; the general 
case involves the change of these quantities during the time that corresponds 
to the change from old to new simultaneity, as was shown approximately in 
Section 26. It is therefore clear that the sums W = TT (a) and P = 2P (a) 


a a 

will have vectorial properties only in the case of non-interacting particles and 
not in general. 

In the presence of interactions the quantities that are vectors and also 
constant are, to the approximation considered, the integrals of motion derived 
in Section 27. Those expressions differ from (29.01) and (29.02) on the one hand 
by their inclusion of interaction terms, on the other hand by the fact that the 
quantities wi a /vTl — v al c% ) are replaced by the leading terms of the expansions 
of these quantities in powers of vj/e 2 , the expansions being taken to the same 
order as in the interaction terms. 

It may, however, happen that with all particles a large distance apart, the 
interaction terms contribute negligibly but that at the same time the speeds 
of the particles are very great. In such a limiting case of weakly interacting 
particles the expressions (29.01) and (29.02) can be used but it must be borne 
in mind that they are applicable only before the beginning and after the end of 
the interaction, and not in the intervening time. 

In the limiting case stated one can use conservation laws for energy and 
momentum in the usual form 


and 


y 


m a c 2 


2 


m a v a 


2 

a 

2 


m a c 2 

va -<*/«*) 

m a vt 

V(i 


(29.04) 

(29.05) 
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where v a and respectively are the values of the velocity of the a-th particle 
before and after the interaction. It is clear from what has been said that the 
left-hand sides of these expressions do not remain constant during the whole 
time of interaction, but that after the interaction they regain the values they 
had in the beginning. 

The remarks made in this section are equally true in relation to the con¬ 
ventional formulation of the conservation law for angular momentum, in which 
also interaction terms are not considered. 


30. The Vector of Energy Current (Umov’s Vector). 

Let us consider the familiar non-relativistic equations of motion in the 
mechanics of continuous media. They have the form 


and 




8pik 

8x k 


(i = 1, 2, 3) 


0P , 4* 8 (? v i) = Q 

8t xLi 8x j 


(30.01) 

(30.02) 


Here v v « 2 and v 3 are the components of the velocities of the particles con¬ 
stituting the medium, p is the density, the stress tensorf and F{ the com¬ 
ponents of the external force acting on a unit of mass. The accelerations 
dvjdt appearing on the left-hand side of (30.01) are the so-called substantial 
derivatives, given by 


dvi dvi -s—' 8vt 

dt 8t < 8xjc 

k =i 


(30.03) 


The equation (30.02) is usually called the equation of continuity. It is well 
known that it expresses the conservation of mass. 

The system of equations (30.01) and (30.02) is not in itself complete; it 
does not allow one to determine the motion of the medium from given initial 
and boundary conditions. Two steps are needed to obtain a complete system 
of equations : first, one has to express the stress tensor in terms of other 
quantities such as deformations, velocities, pressure, temperature and electro¬ 
magnetic and other fields and, second, if the number of unknown functions is 
greater than four—i.e. greater than the number of equations in (30.01) and 
(30.02)—one must add to these equations some others such as the equation of 
state, the equation of heat flow, field equations, etc. 

In the following we shall confine ourselves to the case of a conservative 
system in the absence of external forces. 

We consider first a compressible elastic medium. Then we can introduce 
II, the potential energy per unit mass of the medium and express the stress 
tensor in terms of it. Let a v a 2 and a 3 be some Lagrange variables, say the 
initial coordinates of a particle. At time t the coordinates of the particle will 
then be 


Xi = Xi(a lt a 2 , a 3 , t) (i = 1, 2, 3) 


(30.04) 


t In this section we use the terms “ vector ” and “ tensor ” in their three-dimensional 
meaning. 
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and the deformation of the medium is characterized by the set of quantities 


2 dxt 
3a m 3a « 


(30.05) 


i=i 


The potential energy II is a function of the deformation and if one puts 


dll = 


l JL 

~T e 2 


P mn dAr 


(p« 


pnm'j 


(30.06) 


m,n =1 

the components pa of the stress tensor can be expressed in terms of the co¬ 
efficients P mn as follows 


(30.07) 


V dXi 8x * / \ 

Vik = > P m *— — (pik = fu) 

mt^Li 8am 8an 

From these equations it is readily deduced that 

dw /an an\ -ep dvi 

p s - Hit +1’W - 2 ''•Si <30 - 08) 

i — 1 if fC — 1 

In the case of a fluid 

pile = —p§ (30.09) 

where p is the pressure, so that the tensor pa reduces to a scalar. In this case 
equation (30.08) becomes 


dll p dp 

p = — p div v = - — 

^ dt r p dt 

by virtue of (30.02), and this leads to the usual expression 


n = = 't-t 

J P 2 J P P 


dp p 


(30.10) 


(30.11) 


for the potential energy of a unit mass of fluid. For infinitesimal deformations, 
when the usual theory of elasticity applies, relation (30.08) can equally well be 
verified directly without recourse to Lagrange variables. 

It is interesting to consider the relation (30.08) in conjunction with the 
thermodynamic identity 


du ^ di'i da 

- 2 + fT j t 

i,k =l 


dt 


(30.12) 


in which u is the internal energy, a the entropy and T the temperature. For 
isothermal processes one can put 

II = u - Ta = F (30.13) 

where F is the free energy, and for adiabatic processes 

n = u (30.14) 

We now turn to the equations of motion. Using the equation of continuity 
we can write the three equations (30.01) in the form 
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Just as the equation of continuity 


£p , Y a (p v *) = 0 

dl kk. 8xk 


(30.02) 


expresses the law of mass conservation, so one can take (30.15) to express 
the law of conservation of momentum. In the equation of continuity the scalar 
quantity p is the mass density and the vector with the components pvt (i = 1, 
2, 3) is the mass current density. Similarly in (30.15) the vector p vt is the 
momentum density and the tensor 


Stk = P ViVk — ptlc 


(30.16) 


the momentum current density. So the vector p vt enters once as the mass 
current density and again as momentum density. Similarly, remembering 
that Stic = Ski, this quantity enters once as the i-th component of the current 
of pvk and again as the /c-th component of the current of pvt. We have just 
seen that the equation of continuity and the equations of motion can be inter¬ 
preted as laws of conservation of mass and of momentum. It is natural to try 
to express in an analogous manner the law of conservation of energy. This 
was first done by Umov who introduced the important notion of energy current 
as early as 1874 [4, 5]. 

We introduce a scalar 

S = i p»* + P II (30.17) 

and a vector with the components 

St = I’iQpt’ 2 + pnj — (30.18) 

In these equations II denotes as before the potential energy per unit mass. 
It is obvious that the two terms in S are respectively the densities of kinetic 
and of potential energy, so that S is the volume density of total energy. The 
vector St which we shall call Umov’s vectorf can be interpreted as the vector 
of energy current. For, using the equations of motion, the equation of con¬ 
tinuity and the relation (30.08), it is easy to verify that 


as y dSt 

8t 8xt 

i=i 


(30.19) 


This can obviously be interpreted as the conservation law for energy. 

One should note that if the equations of motion (30.15) are given, then of 
the two equations: (30.02), the continuity equation, and (30.19), the equation 
describing energy flow, only one is independent. Either can be obtained from 
the other by using the equations of motion. Instead of either of them one could 
also take any linear combination of them. This would bring in a quantity 

f The quantities S and Sj actually discussed by Umov differed from those introduced here 
by the absence of potential energy terms. Therefore, the right-hand side of the relation 
given by Umov, which corresponds to (30.19), was not zero but the negative of (30.08), i.e. 
the work done by the elastic forces in unit volume and unit time. 
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p -f XS and the corresponding current pr< + X>S< (with a constant X having the 
dimensions of (velocity) -2 ). In the following section we shall see that the 
relativistic form of the equations of motion for a continuous medium cor¬ 
responds approximately to choosing such a linear combination with X = 1/c 2 . 

In conclusion, we point out the following circumstance. In the foregoing dis¬ 
cussion the flux of a certain quantity S was determined (by means of the 
appropriate conservation law) from its divergence which was assumed* given. 
This does not determine the flux uniquely. According to its physical meaning, 
however, the flux vector is.a completely determined quantity. Therefore we 
must expect that if the requirements of the theory of relativity are met, it 
should be formally possible to make the definition of the flux unique, provided 
certain additional conditions are imposed on it. One of these conditions is 
the requirement that the flux vector of the function of state S, should itself be 
a function of state. This requirement will be formulated more precisely and 
discussed in detail in the two following sections. In the examples here discussed 
the requirement is satisfied by the energy flux (Umov’s vector), the mass flux 
and the momentum flux. 


31. The Mass Tensor 

In the previous section we saw that for a conservative system the usual 
non-relativistic equations of motion of a continuous medium can be stated as 
four equations in each of which a sum of derivatives with respect to the spatial 
coordinates and to the time is equated to zero. This form of equation leads 
us to surmise that in Relativity they possess a generalization of the form 


8T 00 -A 8T 0k 
8x 0 8x]c 

8T i0 ^ 8T ik 
8x 0 8xic 


(31.01) 


where T ik (i, k = 0, 1, 2, 3) is some tensor. 

The first of these.equations should.express conservation of mass or energy 
and the other three, for i = 1, 2 and 3, the conservation of momentum. If 
T 00 is the density of total mass, including the rest-mass and the mass of the 
kinetic energy, then cT u is the mass current density. Also, if cT i0 is the density 
of the i-th component of momentum, then c 2 T ik (k = 1, 2, 3) will be its current 
density. The mass M contained in some volume and the energy W corres¬ 
ponding to it will be expressible in terms of T 00 as the volume integral 


M = 


W 

e 2 


/ 


T 00 dV 


(31.02) 


Similarly the momentum contained in the same volume will be 

P* = cjT io dV (31.03) 

The equations (31.01) express the fact that any increase of the energy or the 
momentum contained in a volume is due to an influx of energy or momentum 
from outside the volume, i.e. through the closed surface enclosing the volume in 
question. The integrals (31.02) and (31.03) taken over all space are the total 
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mass and the total momentum of the system. They remain constant in time. 

In addition to the laws of conservation of mass, energy and momentum, 
conservation laws must also hold for the angular momentum and for the mass 
centre motion of the system. They can be written in a form analogous to 
(31.01). It follows directly from that set of equations that 

—(x { T k0 - xtT*) + ’V — ( xtT *» - x k Tim) = - T** ... 

C/3/q *■ (OXyji (ol.U4) 

m=i 

(i, k = 0,1, 2, 3) 

and these relations have the form of the required additional conservation laws 
if their right-hand sides vanish. Hence we get the condition 

pic = T** (31.05) 

i.e. the tensor T lk must be symmetric. This means in particular that the mass 
current must equal the momentum not only in the equations of mechanics, 
but quite generally. We shall now state the relations (31.04) for a symmetric 
tensor separately for k — 1, 2, 3 and for k = 0 : 


3 3 Q 

• 5 - (x t T k0 - x k T t0 ) + y —(xiT km - x k T tm ) = 0 


and 


8 

8x, 


m =l 
3 


(xiT 00 - x 0 T i0 ) + y — (: v t T •» - x 0 T lm ) = 0 

• * OXj ji 


(31.06) 


(31.07) 


Here i, k — 1, 2, 3. The equations (31.06) can be considered to be the laws of 
angular momentum conservation and the equations (31.07) as the laws of 
motion of the mass-centre. In both cases these are the laws in differential 
form ; integration of the equations over some volume gives the corresponding 
integral relations. 

If the integrals 


and 


M ik = c J {x { T k0 - x k T t0 ) dV 

-s 


K* = - M i0 
c 


(Xi T 00 - x 0 T i0 ) dV 


(31.08) 


(31.09) 


are taken over all space they will remain constant; the quantities M 23 , M 31 , 
and M 12 are the components of angular momentum of the system and the 
quantities M 10 , M 20 and M 30 divided by c can be interpreted as the products 
of the mass of the system and the initial coordinates of its mass centre. 

The tensor we are discussing will be called the mass tensor.f Its invariant 
has the dimensions of a mass density. When multiplied by c 2 the tensor T ik 
is also called the energy tensor, its invariant then has the dimensions of energy 
density. 

We shall impose on the mass tensor another condition which can be stated 
in the form of a physical principle, as follows : The mass tensor must be a function 
of the state of the system. We must define what we mean by state of a system. 

Let us assume that the equations of motion and field equations are written 

f We prefer this name to the often used “matter-tensor”. The notion of “ matter ” is of a 
very general character and should not be identified with the notion of mass. 
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as a system oi n first order equations for the unknown functions <p v . . . , cp n 
and that these equations can be solved for the time derivatives of the functions. 
If initial values are prescribed for the functions <p 1 , . . . , <p„ this, together 
with boundary or similar conditions, determines their values at any subsequent 
time. This is the principle of causality. We shall say that the functions 
<Pi, . .., <p» characterize the state of the system. Any function of <p 1( . . ., <p„ 
which does not contain their time derivatives and also does not contain the 
coordinates explicitly will be called a function of the state. (We may note 
that in the Theory of Relativity derivatives with respect to spatial coordinates 
and to time enter symmetrically and therefore if some scalar, vector or tensor 
does not contain any time derivatives it will also not contain any space deri¬ 
vatives.) 

Let us give some examples. In the mechanics of mass points a state is 
characterized by the coordinates and velocities of the particles, therefore any 
function of coordinates and velocities is a function of the state. In hydro¬ 
dynamics a state is characterized by the three components of velocity, the 
density and the pressure, with the two last quantities related by some equation. 
In the theory of the electromagnetic field a state is characterized by the anti¬ 
symmetric field tensor. 

Thus our physical principle^ requires that the components of the mass 

tensor should be functions only of the quantities that characterize the system. 
Derivatives of these quantities must not enter nor should the coordinates 
appear explicitly. (We assume of course that cartesian coordinates are used.) 

We now inquire to what extent the mass tensor is determined by the con¬ 
ditions we have stated. We take first the equation expressing the fact that the 
divergence of the mass tensor vanishes 


2 


8T ik 

8x k 


= 0 


(31.10) 


together with symmetry condition T lk = T ki . In addition we make the con¬ 
dition that the ten integrals of these equations, i.e. IF, T H and M ik should have 
fixed values. 

Let A im ’ nk be a tensor of fourth rank having the following symmetry pro¬ 
perties : antisymmetry with respect to the first pair of indices 


Aim, nk — _ j[ mi < nk 


(31.11) 


antisymmetry with respect to the second pair 

nk — _ kn (31.12) 

and the cyclic symmetry 

yjim, nk A in ’ km + A ik ’ mn = 0 (31.13) 


From these properties it follows that 

Aim, nk — A nk < (31.14) 

In terms of second derivatives of A im ’ nk we can construct the tensor 


l It seems that this principle has never been explicitly formulated. However, it is 
satisfied by all forms of the mass tensor in common use. 
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£ilc 


3 

2 


m, n --- o 


nk 

8x m dx n 


(31.15) 


which we shall call-Krutkov’s tensor.f It is easy to see that it is symmetric 
and that the antisymmetry of A im ’ nk in n and k results in the identity 


•A dB ik 


= 0 


(31.16) 


In addition, Krutkov’s tensor has the following property. If the tensor T ik 
is replaced by B ik in the volume integrals of the form of M, P\ M ik and K* 
(equations (31.02), (31.03), (31.08) and (31.09)) these all reduce to surface 
integrals. The integrals formed with B ik will thus all vanish if the tensor 
A im ’ nk and its first derivatives become zero on the surface enclosing the volume, 
or, when the volume considered is all space, if they tend to zero sufficiently 
fast at infinity. 

Let T ik be a given mass tensor. Adding to it a Krutkov tensor we get a new 
tensor # 

Tik = fa k + B ik (31.17) 

which has the following properties. Firstly, it will be symmetric, secondly, 
by (31.10)‘and (31.16) it will satisfy the equation 


2 


db* 

8x k 


= 0 


(31.18) 


j|j 

Thirdly, the integrals M, P\ M ik and K l formed from T lk will be equal to 
similar integrals formed with the given tensor T ik . 

Thus, apart from the condition that the mass tensor should bo a function 
$ 

of the state, the tensor T ik obtained by (31.17) from T lk will satisfy all require¬ 
ments. In other words, the mass tensor would not be determined uniquely 
if no further condition were imposed. 

However, the condition that the mass tensor should be a function of the 
state completely changes the situation and ensures that the definition of the mass 
tensor is unique in principle. The following circumstance can be considered an 
indication of this. Let us assume that the tensor T ik is a function of the state, 
so that it satisfies all the conditions imposed, including the requirement just 
discussed. Even if the fourth rank tensor A im • nk is also a function of state, its 
second derivatives will not possess the same property and addition of the 
quantities B lk to T ik is not permissible. It is true that these considerations 
cannot be considered a proof, but a full proof of our statement can be con¬ 
structed by other means. We shall discuss this in the following section. 

The uniqueness of the mass tensor becomes particularly important in the 
theory of gravitation because the equations of this theory contain the mass 
tensor itself, not only its divergence. 

f In three dimensions a fourth rank tensor Al m ' nk with the properties given reduces to a 
symmetric tensor of second rank with the components y n = A 23 ’ 23 , y 22 = / l' u,31 > 7ns = 
A 12 , 12 , y 23 = A 31 - 12 , y n = A 12 ’ 23 , y J2 = A 33 ’ 31 . The tensor y pq was introduced and widely 

used by G. Krutkov in his book [10]. 
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When expressing equations of motion as the vanishing of four-dimensional 
divergences we imply that the physical system under consideration is con¬ 
servative. A general remark on non-conservative systems should be made. 

As the law of energy conservation is a universal one, a non-conservative 
system should be regarded as one in which some forms of energy, such as heat, 
have not been explicitly taken into account, if there is some form of energy 
partaking in the general balance but not included in the tensor' T ik the diver¬ 
gence of this tensor will not vanish; the equations (31.01) will then have on 
their right-hand sides the flux into unit volume of this form of energy, and of 
the corresponding momentum. In the following we shall deal only with conser¬ 
vative systems. 

31*. A System of Equations for the Components of the Mass Tensor 
as Functions of the Field f 

Let us assume that the state of a physical system is described by the set of 
functions <p 1( <p 2 ,. . . , <p n . This set of functions may be called a field and the 
equations of motion relating these functions may be called field equations. 
We have already given examples of physical fields (the fields of velocity, density 
and pressure, the electromagnetic field, etc.). 

The statement made in the previous section concerning the mass tensor can 
be made more precise in the following manner. The symmetric tensor T ik , the 
components of which are functions of state (i.e. functions of the field), and whose 
divergence vanishes as a consequence of the field equations, is determined by 
these equations (which are purely local) apart from two constants. The tensor 
T lk will be defined apart from a constant multiplying factor, provided local 
conditions are supplemented by a condition at infinity. This latter usually 
amounts to the requirement that the mass tensor should become zero where the 
field is zero (i.e. where all or some of the functions <p„. become zero). The con¬ 
stant factor evidently depends on the choice of energy or mass units, and its 
value is determined if these units are fixed. 

Thus finally the mass tensor is uniquely determined for every physical 
system. 

The proof of this statement can be given in each individual case by considering 
two systems of differential equations satisfied by the components of T ik con¬ 
sidered as functions of <p x , <p 2 , . . . , <p„. The first of these systems expresses the 
fact that T tk is a tensor and the second the fact that the divergence of this 
tensor vanishes. 

To derive the first system of equations, let us examine an infinitesimal 
Lorentz transformation. 

Equation (23.06) shows that in such a transformation the components of a 
vector receive the increment 


8A‘ = | e*to<L4* (31*01) 

k = 0 

t This section and Section 49* have been added to the second Russian edition. To pre¬ 
serve the numbering of the original edition, equations appearing in these two sections are 
marked with asterisks. In the first English edition part of the present Section 31* was 
included at the end of Section 31. 
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The corresponding relation for the increment of tensor components is 

3 3 

8T ik = 2 e m or im T mk + 2 e m <x> km T im (31*02) 

m =0 m=0 

Since the functions of state <p 5 are also subject to a definite transformation law 


(they are always expressed in terms of some scalars, vectors and tensors), they 
will also obey relations of the form 

or 

Sep* = - ^ e/e m to im 4 l s m 

l, 171 = 0 

(31 *.03) 


1 ■■ 

8<p« = ^ 2 (siim 'Vr 

l, m = 0 

(31*04) 

where 


(31*05) 

are some 

known functions of <p„ cp 2 , . . . , cp n . 


We consider certain general properties of the coefficients 

it can be 


shown that they satisfy the relation 



= ~\ 4 <'|A,4r + e q 8 lq ^ 

(31*.06) 


To prove this we note first of all that this relation is functionally invariant in 

the following sense. If in place of the functions <p t , <p 2 , . . . , <p n we introduce 
* * * 

new functions of state <p t , <p 2 , , . . . , <p„ (each set being expressible in terms of 

* 

the other) and if we also introduce corresponding coefficients related to the 

¥ s m by 

n * 

Js w = 2^" , r f (31 *' 07) 

r = i < * >r 

then equation (31 *.06) will be satisfied for the starred quantities. On the other 
hand relation (31 *.06) is easily verified directly if the cp r form a scalar, a vector 
or a tensor (or a set of scalars, vectors or tensors). Since ultimately all functions 
of state must be expressed in terms of scalars, vectors and tensors it follows 
from the functional invariance of (31 *.06) that this relation is generally valid. 
We shall not go into the detailed exposition of the manipulations involved in 
this proof. 

The foregoing equations can be written in a more elegant form by introducing 
the operators X lm defined by the equations 


X i m (/)= jr y r m 

r = l 


d<pr 


(31*08) 
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Then we have, in particular 

X lm ( ?s ) = (31*.09) 

Using (31 *.06) it is easy to ]trove the operator equation 

Xv<>X lm — X lm Xv<i = e v 8 mp X l i + e q 8 mq Xv l -f e p 8i P Xi m + e q 8i q X m P ^ ^ 

If this equation is applied to a function <p s , relation (31*.06) follows conversely 
from it. 

The change of any field function (i.e. any function of the 9l , <p 2 ,. . . , <p„) in 
an infinitesimal Lorentz transformation can be expressed in terms of the 
operators X lm . Indeed, if / = /(<jq, . . . , <p re ) it follows from (31*.04) that 


8/=^ 2 ^lrnX^{f) (31M1) 

l, m = 0 

Putting/ = <p s and using (31*.09) we return to equation (31*.04). The formula 
(31*. 11) is convenient because of the fact that it is independent of the choice 
of functions 9l , . . . , <p„. 

We apply equation (31 *.11) to one of the components of T ik . We have 
8 T ik = g 2 u>lmX ' m ( Tik ) (31*12) 

l, m = 0 

On the other hand, since T ik is a tensor, equation (31*.02) holds; it can be 
written in the form 3 

J <*im(eiSuT mk + e k 8 kl T*™) (31*.13) 

l, m = 0 

Both expressions for T lk must be equal identically, i.e. for any choice of anti¬ 
symmetric quantities co/,„. Equating the antisymmetric part of the coefficients 
of the in the two expressions we obtain 

X im { T ik ) et8uT mk - e t 8 im T' k i e k 8 kl T^ - e k 8 km T" (31*. 14) 


in more detail 


n 


8T ik 

i m —— = OS a T mk - etStm 


dtps 


T lk 4 e^ kl T^ _ e *8 km T» (31*. 15) 


(31*16) 


This is the first of the two aforementioned systems of differential equations for 
the T lk as functions of the field. 

The second set of equations is obtained from the condition that the diver¬ 
gence of the mass tensor vanishes. It follows from this requirement that 

y ydT*dv s _ 

jfcT'o 8( ? s 8xk 

and this equation must hold for all values of the cp s and of the d^ s jox k which 
are connected by the equations of motion (and possibly by other relations, 
if not all these quantities are independent). When the <p s and the 8o s jdx k are 
expressed in terms of independent quantities equation (31.27) must reduce 
to an identity in these latter variables. Since the T ik contain only the functions 
<p s themselves, but not their derivatives with respect to the x m this identity 
leads to a set of relations for the dT ik jd(p s which involves only the <p r not the 
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d<p r l8x m . These relations are just the second system of equations for the mass 
tensor. We note that both systems are satisfied by the values 


T tk = XejSjj; (31*.17) 

where X is a constant. 

The complete system of equations for T ik consists of these two sub-systems: 
equations (31*. 15), which express the fact that the T ik form a tensor and further 
equations which state that the divergence of this tensor vanishes; the latter 
are obtained from (31*.16) by eliminating the derivatives dcp s /&r* with the aid 
of the field equations. The complete system .can be discussed in a simple form 
in particular cases that correspond to given physical systems. When this is 
done, the uniqueness of the mass tensor is confirmed every time. Examples of 
uniqueness proofs are given in Appendices B and C. Further examples can be 
found in the literature [11]. 


To conclude this section we shall show that if the mass density T 00 is known 
as a function of the field <p lt . . ,, <p„, equations (31*.14) or (3P.15) also permit 
the unique determination of all other components of the mass tensor. Indeed, 


if we assume that 
m = 1, 2, 3 we get 


T 00 is known and if in (31 *.14) we put i — k— I — 0 ; 


pm J'mo __ ^X 0m (T° 


( 31 * 18 ) 


where the operator X 0m has the value given by (31 *.08). Thus, the mass flux 
density has been expressed in terms of the mass density. Further, putting 
i = l = 0; k, in = 1, 2, 3 we get 


or, using (31*. 18), 


fmk ^ x om (T ok ) - $ mk T 00 
T mk - (.lA om A°* - S mk )T 00 


(31 * 19 ) 
( 31 * 20 ) 


This expression is in fact symmetric in m and k. Indeed, we have the operator 

relation 


X om X° k - X ok x om = X™ k (■/«., k - 1 , 2 , 3 ) ( 31 * 21 ) 


and from (31 *.14) it follows that the result of applying this operator to 7' 00 is 
zero (this last fact means that 7’ 00 is a three-dimensional scalar). 

Thus in order to obtain all components of the mass tensor it is sufficient to 
know the component T 00 . 


32. Examples of the Mass Tensor 

In this section and the following we shall discuss the explicit form of the mass 
tensor in some concrete cases, without dwelling on the proof of its uniqueness 
Examples of such proofs will be given in Appendices B and 0. 

We begin with the simplest case of “ dustlike ” matter, by which we mean 
an assembly of non-interacting particles which nevertheless have a con¬ 
tinuous variation of velocity so that a velocity field can be defined. In this 
case, we introduce the special notation Q ik for the mass tensor. We denote by 
p* the invariant mass density, i.e. the density in that frame of reference relative 
to which the particles of the particular volume element in question are moment¬ 
arily at rest. This frame might also be described as travelling with the particles 
of this volume. Let v> be components of the four-dimensional velocity of the 
particles. We put j 

k) ,k — —o *u’n k 
c 2 ‘ 


(32.01) 
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By definition Q ik is a four-dimensional contra variant tensor. Its zero-zero 
component is 




(32.02) 


This component must be equal to the density of total mass, including the 
mass of the kinetic energy. We shall show that this is so : if p* is the density 
of rest-mass in the frame moving with the particles, then the density of rest- 
mass in the “ laboratory frame ” relative to which a particle moves with 
velocity v, is 


V(1 - « 2 /c 2 ) 


(32.03) 


Further, if p is the rest-mass density, the density of the total mass, including 
kinetic energy, is 

P a = y = 0°° (32.04) 

V (1 -- v 2 lc 2 ) 1 - v 2 lc“ 

This expression for density corresponds to the usual expression 

M = (32.05) 

V(1 - ® 2 /c 2 ) 

for the mass of a single particle. The other components of the mass tensor 
are, in three-dimensional notation 


00. = 1 p* 1 ’* ] P*’« 

cl— V 2 /c 2 C \/( 1 — v 2 /c 2 ) 

and 0< * = 1 = I ? ViVk 

c 2 1 — v 2 /c 2 c 2 V(1 — V 2 /c 2 ) 

Let us form the divergence of the tensor 0<*. We have 


x 1 x ' 8(g*u k ) p* x^ du* 

> —- - u‘ > —- - + v — > u* —■ 

f-4 dj'ic c~ ^ 0Xk C 2 r-—i 8Xk 


(32.06) 


(32.07) 


(32.08) 


We introduce special symbols for the sums in (32.08), putting 


e* = 2 


d(g*u k ) 


(32.09) 


* y „* — 
a 8xk 


These can also be written as 


Q* ~ I div(pv) 

a 


and w { 


2_ tat+ y n —\- 

- »'/c ! ) \ a 4 On,! 


v /(l -t> 2 /c 2 ) dt 


(32.10) 


(32.11) 


(32.12) 


where d/dt is the substantial derivative. This shows that the invariant Q* 
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is the rate of increase of rest mass in unit fluid volumef and the vector w l is 
the four-dimensional acceleration, the space components of which become the 
ordinary acceleration in non-relativistic approximation. Thus 


^ flfcX* 

in SXk 

k —o 


Q* 

r>2 


= — id -f- —- W* 

O 1 . 0 


(32.13) 


By the equation of motion this expression must be zero. But we have the 
identities 

a a 

2 u i u i = c 2 ; 2 u i wi = 0 (32.14) 

i — o < - o 

so that the equation 

Q*u l + p*ic ! ' = 0 (32.15) 

is equivalent to the two separate ones 

Q* = 0 and iv'= 0 (32.16) 

The first of these is the equation of continuity which expresses the constancy 
of the rest-mass of the particles. In the case we are considering the rest-mass 
does not change because the particles do not interact and their internal energy 
remains constant. The second equation of (32.16) expresses the constancy of 
the four-dimensional, and hence also of the three-dimensional, velocity. That 
non-interacting particles move with constant velocity is, of course, physically 
obvious. As the equations of motion are of first order in p* and u { and as Q ik 
is expressed as a function of these quantities this mass tensor is clearly a function 
of the state. Therefore the form of the mass tensor for dustlike matter may be 
considered established. 

Let us now seek to generalize the equations of perfect fluid hydrodynamics. 
In non-relativistic theory this case is characterized by the stress tensor 
reducing to a scalar. This readily permits a relativistic generalization. Let 
us assume that the energy tensor, i.e. the mass tensor times c 2 , has the form 

c*T* k = |p* + ~jutu k - peA k (32.17) 

where pi* and p are four-dimensional scalars connected by the relation 


ft* =/(p) 


(32.18) 


In the frame, relative to which the velocity at the point and at the moment 
in question is zero, the component T 00 of the mass tensor is equal to p*. There¬ 
fore p* is the rest-mass density of the fluid. As the fluid is assumed elastic 
and can possess compressional potential energy, this rest-mass includes the 
mass corresponding to that energy. Since the compressional energy may 
change, the-rest-mass of a volume of fluid.will not stay constant. .With this 
fact in mind w T e introduced a special symbol p* for this rest-mass density leaving 
the symbol p* to refer to that part of the rest-mass density which does not 
change in the course of the motion. 

t By “ fluid volume ” we mean the volume of an element of fluid always consisting of the 
same particles. 
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We now state the equations of motion. Putting for brevity 

3 


q - y +- 2 W 
dXk Lv "v 


and using the notation w ! ' for acceleration as before, we have 


(32.19) 


^ 8T ik , 1 _ r»\ . 

2 is - «* ,H K+?-»' -«ss 

- = 0 


(32.20) 


In the absence of external forces this expression will be equal to zero. 

Using equation (32.14) we can get from (32.20) another expression for Q, 
namely 


0=IV#-- 1 dp dp 

c 2 ( etefc c 2 -y/(l — « 2 /c 2 ) dt c 2 dr 


(32.21) 


Here dpjdt is the substantial derivative of p and dx the differential of the 
particle proper time. As for p itself, a comparison of these equations of motion 
with the non-relativistic ones shows readily that it is the pressure. Equating 
the two expressions for Q we obtain 



8u k 

dx k 


=0 


(32.22) 


As we assume p* to be a function of p we can introduce a new quantity 
p* by the differential relation 


dp* df* 
p* p* + p/c 2 


(32.23) 


We can choose the constant of integration in such a way, say, as to make 
p* = fj* when p = 0. Using (32.23) we can write (32.22) as 


y (p*«*) = 0 (32.24) 

h dXk 


Hence we see that p* can be interpreted as the invariant density of that part 
of the rest-mass which does not change in the motion. This quantity, like 
p*, is a function of p. 

Let us put 


p* = 



(32.25) 


Then we get from the differential relation between p* and p* that 


dll 


p dp* 

p* 2 


(32.26) 
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whence 


11 


dp __ P_ 
P* P* 


(32.27) 


The quantity II can be interpreted in analogy with (30.11) as the potential 
energy of a unit mass of fluid, with mass meaning that part of the rest-mass 
which does not vary during the motion. Expressing p* in terms of p* and II 
we can write the energy tensor as 


c 2 2'« = 


p* + 4 (p*n + p) 




penile 


while the equations of motion appear as 


?* + (p*n + P ) 

c- 


, tip 1 dp 
w i = e t -h - - ~ m< 
ext c- a t 


(32.28) 


(32.29) 


The equation for the zero component of acceleration can be replaced by the 
equation of continuity in the form (32.24). 

One may note that the invariant of the mass tensor is 






3 P 


(32.30) 


Comparing the equations of motion (32.29) with equation (32.28) for T ik one 
sees that the mass tensor T ik is a function of the state, as it should be. 

We now write down the components of the mass tensor in non-relativistic 
approximation, but retaining in T 00 and T m terms of an order of 1 /c 2 compared 
with the main terms. In the main terms we put 

1 


P - 




(32.31) 


in agreement with (32.03). Here p is the usual density that satisfies the equation 
of continuity (30.02). We then have 


+pu) 

T oi = l ?Vi + ^ (l, ?v 2 + ?n + pj (32.32) 

T ik = — (p ViVic + p8tk) 


Comparing the correction terms in T 00 and T oi with Umov’s scalar and vector 
(30.17) and (30.18) we see that the components T 00 and T oi of the mass tensor 
can be written as 


poo 


P + 1 1 


T oi = - p Vi (- -. St 


(32.33) 


Thus Umov’s scalar and vector give the second order relativistic corrections 
to the usual expressions for mass density and momentum density. As for the 
spatial components of T ik , they arc proportional to the three-dimensional 
tensor of momentum current density which was considered in Section 30. 

Using these results w r e can also write down approximate expressions for the 
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mass tensor for the case of an elastic body. We have 

T °° = p + ^(i p " 2 +pn ) 

T oi = i pv t + i|t) t Q pi’ 2 + pH j — 2 PtkV Jtj (32.34) 

T ik — - (fiViV/c — ]Hk) 

If here we put pa = —pSu as in (30.09), we get back to the case of a perfect 
fluid. 

We shall not discuss the relativistic formulation of the theory of elasticity. 
We shall only make one remark concerning the possibility of using the concept 
of an absolutely rigid body in Relativity. In non-relativistic mechanics this 
concept is introduced as an abstraction, according to which the shape and 
size of such a body does not change, whatever the forces acting on it. In 
particular, a blow imparted at some instant to one end of an absolutely rigid 
body immediately sets the other end of the body in motion. In fact, in a 
physical solid body the blow produces a wave in the body that spreads with the 
speed of sound. Therefore the abstraction implies the assumption that the 
speed of sound may be treated as infinitely great.f However, it is evident 
that if the speed of sound is assumed infinitely great, the speed of light, which 
is actually some hundred thousand times greater, must also be taken as infinite. 
It is thus clear that the abstraction involved in defining an absolutely rigid 
body is permissible only in a non-relativistic theory. As the Theory of Relativity 
is based on the fact that any kind of action propagates with finite speed, this 
abstraction must inevitably lead to contradictions. Therefore the notion of 
an absolutely rigid body may not be used in Relativity Theory. 

However, this does not preclude the use of the notion of a rigid measuring 
rod in discussions of Relativity. For this notion merely presupposes the existence 
of rigid bodies whose shape and size remain unchanged under certain particular 
external conditions such as the absence of accelerations or impulses, constancy 
of temperature, etc. Such rigid rods can be realized with sufficient accuracy 
by solid bodies existing in nature and they can serve as standards of length. 
As we showed in Section 2 the constancy of their length can be checked, say 
by comparison with the wave length of some spectral line. 

33. The Energy Tensor of the Electromagnetic Field 

In the preceding sections we discussed the mass tensor, and the energy 
tensor which is proportional to it, for a substance consisting of particles inter¬ 
acting by means of elastic forces. We now consider the energy tensor for a 
substance of particles interacting only by means of the electromagnetic field. 
As we are dealing with macroscopic theory we can imagine the substance to 
be a continuous medium with a continuous charge distribution. 

We start from Maxwell’s equations in the form given by Lorentz. We put 
for the components of the electric and of the magnetic field 

t One reaches the same conclusion if one considers the absolutely rigid body as the limiting 
case of an elastic body in which the elastic moduli are infinite. 
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E\ — Fio ! E 2 = F 20 ; E 3 — F 30 

Hi — F 23 ; H 3 = F 31 ; H 3 = F 13 


(33.01) 


As we saw in Section 24, the four-dimensional statement of Maxwell’s equations 
is 


and 

where the 


Fm = 


SFjic 


dF/ci 8Fu 
dxi 8x]c 


v ' dF^ k 

2 Hxf = S< 

F ik = eiejcFac 


(33.02) 

(33.03) 

(33.04) 


are the contravariant components of the antisymmetric field tensor and where 
the four-dimensional vector s* has the components 

s° = 4rcp ; s* — —ji = — p»< (i = 1, 2, 3) (33.05) 

c c 


Here p denotes the charge density, v* the three-dimensional velocity and 
ji — p Vi the electric current density. Introducing the invariant charge density 
p* and the four-dimensional velocity u l , we may replace (33.05) by 

s* = — p*M i (i = 0, 1, 2, 3) (33.06) 

c 


We saw in Section 25 that the equations of motion of a particle of charge e 
and rest-mass m is of the form 


mwi = 


o 

^ EikU k 


*=o 


(33.07) 


Here we have on the right the Lorentz force acting on the charge e. To 
pass from a single particle to a substance with continuous charge and mass 
distxibution we must introduce the invariant charge density p* in place of the 
charge and the invariant mass density p* in place of the mass. We then get 


or 


p*iCi 


= _ 7 2 FikUk 


k =0 

3 


l*lCf = — 2 Fiksk 


k=0 


(33.08) 

(33.09) 


As we have introduced two new functions, p* and p* which are not necessarily 
proportional, we must adjoin to the equations of motion two further equations 
for these functions. The equation for p* is already implied by Maxwell’s 
equations and is an expression of the law of charge conservation 


T' 8(p*u k ) 

8xk 
k- o h 


(33.10) 


The equation for p* must be stated separately. We assume that the rest- 
mass of the particles does not change in the motion ; (this assumption implies 



108 The Theory of Space Time and Gravitation 

that no Joule heat is developed). Then we have 

«*-2 


k — 0 


£ {y.*u k ) ^ Q 

dx/c 


(33.11) 


Of the equations of motion (33.09) only three are independent, because we 
have the identity 

\ * 3 

ii* ^ u l w ( - V FikuW = 0 (33.12) 

i—0 i,k -o 

Thus the equations of motion (33.09) together with the equation of rest-mass 
conservation represent four independent equations. Our problem is to write 
them in the form of the divergence of some tensor. To do this we first use 
equation (32.13) according to which 


p*w < + Q*u ( — c 2 ^ 


dx/c 


where 


Q ik = — \x*n i u k 


(33.13) 


(33.14) 


(Note that now the invariant rest-mass is denoted by p* and not by p* as in 
(32.01)). On the other hand the equations (33.09) and (33.11) can be combined 
into the four independent ones 


1 —v 

P*« H + Q*Ui = - — ^ F ik s k 


(33.15) 


which are equivalent to (33.09) and (33.11), as a consequence of (33.12). The 
left-hand sides of the last set of equations, or rather their contravariant forms, 
have already been expressed as the divergence of a tensor. It remains there¬ 
fore to do the same with the right-hand sides. 

For this purpose let us consider the tensor 

Uik — - - — Cml 1 imFkm “b i mn F mn (33.16) 

m~ 0 m,n- 0 

and denote by/j the negative of its divergence. We have then 

8Uik 




eje 


k o 


dx k 


Performing the differentiations, some simple calculation gives 
1 ^ 8F kl 1 

/<= “^ 2 Fik -^r-^ 2 FklFikl 


(33.17) 


(33.18) 


k.l-0 


k.l=0 


where the Fm are the quantities defined in (33.02). By virtue of Maxwell's 
equations (33.02) and (33.03) we find 


k^a 


(33.19) 

which is the Lorentz force as it appeared on the right-hand side of (33.15). 
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Tims both members of (33.15) have been represented in the form of divergences. 
Passing to contravariant components we can write the equations 



\i*w t 4 - Q*u { —f i = 0 

(33.20) 

as 

2 ^ - 0 

K — 0 

(33.21) 

where 

£*2 Qik ik 

(33.22) 


is the energy tensor of the whole system, comprising the material medium and 
the field. In the last equation the first term on the right is the energy tensor 
of the medium and the second that of the field. 

The tensor T ik involves the invariant density p*, the velocity components 
u l and the field components Fa. Since these quantities characterize the state 
of the system, this mass tensor is evidently a function of the state. 

We now examine the field energy tensor in a little more detail. 

In three-dimensional notation the components of the energy tensor of the 
electromagnetic field have the following form 



U 00 = 

u 0 o = h (E 2 + in 

(33.23) 


IJOi = 

-U 0 , = I [E x H], 

(33.24) 

and 

U ik = 

U lt = - 1 (E t E k + IhH k ) + 1 S ijfc (Z? 2 + H 2 ) 

m 071 

(33.25) 

We note that the invariant of the energy tensor vanishes. 




y CiUu = o 

(33.26) 




As always the component U 00 represents the energy density, and the com¬ 
ponents U w multiplied by c the energy current vector, i.e. Umov’s vector. 
It can be written as 


S = i- [E x H] (33.27) 

4rc 

and is usually called the Umov-Poynting vector, or simply Poynting’s vector, 
because Poynting was the first to give the explicit expression for the energy 
current density of the electromagnetic field resulting from Maxwell’s equations. 

Quite generally, the energy current divided by e 2 is equal to the correspond 
ing mass current and the mass current is in turn equal to the momentum. 
Therefore the Umov-Poynting vector divided by c 2 is equal to the electro¬ 
magnetic momentum density. The fact that the electromagnetic field carries 
momentum gives rise to the forces of radiation pressure first discovered experi¬ 
mentally in 1900 bv P. N. Lebedev [13], Equations (33.25) show that if a 
body is situated in an electromagnetic field a surface element da normal to the 
xyaxis is acted on by a force with the components 

F k da --- Uucda (k - 1, 2, 3) 


(33.28) 
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We consider a totally reflecting body, part of whose boundary surface is 
flat. Let the equation of this flat part be aq = 0, with the body extending into 
the region x 1 > 0. We assume that a plane wave of the form 


E\ = —sin 0/ 
El = cos 0/ 

impinges on the body, where 


HI = sin &g 
H% — —cos % 


f = f\t - (x x cos & + a> 2 sin 

9 — 9 ^ — - ( x i cos 0 + a; 2 sin 


&)) 

0 )) 


(33.29) 


(33.30) 


We add to the field (33.29) the reflected wave which is obtained by replacing 
0 by 7T — 0 and g by —g in the preceding equations and so get the complete 
field, satisfying all boundary conditions. The values of the field components 
on the surface of the body will be 

E 1 = —2 sin &f H 1 = 0 

E 2 = 0 II 2 = -2 cos % (33.31) 

E 3 = 0 #3 = 2/ 

where the functions / and g are taken at x x = 0. Inserting these values in 

(33.25) we obtain for the force acting on unit surface area of the body 

Fl= C7 n == ^(/ 2 +^ )cosi!& ; 2 = U 12 — 0) _F 3 = 17,3 = 0 (33.32) 

Hence, whatever the direction of the incident wave, the force acts in the 
direction of the inward normal to the surface and thus is a normal pressure 
(see [12]). 

Let us now consider another case. Let the electromagnetic field consist of 
a superposition of waves, uniformly distributed over all directions. Then the 
statistical mean of the products of field components will be 

W* = l E~%k ; HJh = l 8a (33.33) 

The space components of the energy tensor will then be 

Uik = ~ (F* + = l u 00 8 lk (33.34) 


In this case the electromagnetic field produces an isotropic pressure equal in 
magnitude to one-third of the electromagnetic energy density. This result is 
of great importance in the theory of black body radiation. 


34. Mass and Energy 

According to the terminology in use in mechanics the mass of a body is a 
measure of its inertia (inertial mass). On the other hand the word “ mass ” is 
also used to mean the ability of a body to produce a gravitational field and to 
experience forces in such a field (gravitational mass). Inertia and the property 
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of producing a gravitational field are entirely distinct manifestations of the 
nature of matter. It is, however, no accident that the measures of these two 
distinct manifestations carry the same name. The reason is that both properties 
are always present together and are always proportional to each other so that 
with a suitable choice of units one and the same number can be used to measure 
them both. The equality of inertial and gravitational mass is an experimental 
fact which has been verified with enormous accuracy by Eotvos, but the defini¬ 
tions of the two concepts are quite different, corresponding as they do to quite 
different manifestations of the properties of matter. There exists a physical 
theory, namely Einstein’s Theory of Gravitation, in which the fundamental 
law of the equality of inertial and gravitational mass is accounted for auto¬ 
matically in the sense that one and the same constant appearing in the solution 
of Einstein’s equations figures as inertial and as gravitational mass. 

How should one answer the question : are inertial and gravitational mass 
the same thing or net? Their manifestations are different but their numerical 
characteristics are proportional. Such a state of affairs is customarily des¬ 
cribed by the word “ equivalence ”. 

An analogous question arises in connection with the concepts of mass and 
energy. (For definiteness we shall discuss the inertial mass.) We have just 
recalled the definition of mass. Energy is usually defined as the ability to do 
work. For its definition the essentials are, first, the law of conservation of energy 
and, second, the ability of different forms of energy to transform into each other. 
Both together are known as the law of conservation and transmutation of 
energy. Because this universal law is valid, a measurement of any kind of 
energy can be reduced to a measurement of one particular kind, e.g. mechanical 
energy, and one and the same unit, e.g. a mechanical one, can be used to 
measure all kinds of energy. 

Thus the manifest properties of matter corresponding to mass and to energy 
are undoubtedly different. However, the Theory of Relativity asserts that 
mass and energy are indissolubly connected and are proportional. Any change 
of energy of a system is accompanied by a change of inertial mass. This is 
true not only for the kinetic energy of a body in a change in which rest-mass 
remains unaltered, but also for changes of all kinds of internal energy in which 
the rest-mass also changes. 

Phenomena are known in physics in which the whole of the energy corres¬ 
ponding to the rest-mass of a body can transmute into radiative energy, which, 
of course, possesses the same mass. Conversely rest-mass energy may be created 
at the expense of radiative energy. An example of this is the conversion of an 
electron-positron pair into a gamma quantum and the reverse phenomenon 
of the creation of such a pair by a quantum. 

In the preceding sections we have studied in detail the relation between 
mass and energy. We saw' that to any energy W one should ascribe a mass 
M — Wjc 2 and to every mass one should ascribe an energy W — Me 2 . The 
two quantities are always proportional and if they are both expressed in the 
same units, say those of energy, they may be measured by the same number. 

Further, we saw that the energy tensor differs only by the factor c 2 from 
the mass tensor and that the law of conservation of energy is at the same time 
the law of conservation of mass. 
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Thus to the question posed above whether mass and energy are the same 
thing or not we can give the same answer as to the question of the relation of 
inertial and gravitational mass. The properties of matter manifest as mass 
and as energy are different but their numerical characteristics are proportional. 
Thus in this case we also can talk of equivalenc^-the equivalence of mass and 
energy. But it is better to call this fundamental law simply the law of pro¬ 
portionality of mass and energy. 

We have just stated that the law of energy conservation is at the same 
time the law of mass conservation. A question arises immediately. Experiment 
shows that for an enormous majority of known physical processes the mass of 
a body, determined by weighing, and its energy, determined by work done, are 
conserved separately. One observes two conservation laws. How does this 
relate to the fact that in the Theory of Relativity only one law is formulated ? 

This question can be answered as follows. There is but one rigorous con¬ 
servation law, which applies to the entire mass of a body M and to the cor¬ 
responding entire energy W = Me 2 . But an overwhelmingly large part of the 
energy (and of the corresponding rest-mass) does not usually take part in 
transformations and is conserved separately. It follows then that the remain¬ 
ing part of the energy, which is active in transformations is also conserved 
separately. 

The division of energy into a “ passive ” part which does not undergo any 
transformation in a process and an active part which is capable of changing 
into different forms can be followed through in the examples of the hydro- 
dynamical equations discussed in Section 32. There, by use of the frame 
travelling with the fluid particle in question we separated out the kinetic 
energy and then considered two rest-mass densities, the total density p* and 
the density p* of the passive part. The two quantities were connected by (32.35) 
which gives 

p* = p* + i P*n (34.01) 

where the second term on the right is the density of compressional potential 
energy divided by c 2 . 

The division is still more clearly apparent in the approximate equations 
(32.32) and (32.34) which give the total mass density T 00 and the total mass 
current density cT m . By (32.33) these equations can be stated as 

r °° = p + ^ S ; cT ° l = ^ ^ s < ( 34 - 02 ) 

Here p and cvi are the density and current density of that part of the mass 
which does not take part in any transformation while Umov’s scalar and 
vector represent the density and current density of the active part of the 
energy, or, when divided by c 2 , of the active part of the mass. 

We spoke above of the fact that the total mass of a body as determined by 
weighing, including the variable part, is practically conserved although the 
body may emit or absorb energy. This is explained simply by insufficient 
accuracy of weighing and by the fact that in ordinary bodies a predominant 
part of the mass is passive. A change in the active part, on the other hand, 
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can be detected with much greater accuracy by measuring the corresponding 
part of the energy, i.e. by calorimetric methods rather than by weighing. 

It is natural to inquire into the deeper reason for the fact that in normal 
conditions the predominant part of the energy is bound so durably as to be in a 
completely passive state. Why does even a negligible part of it never leave 
this state and destroy the separate balance of the active part ? The Theory of 
Relativity by itself is unable to answer this question. One should look for the 
answer in the domain of quantal laws, which have as one of their main features 
the existence of stable states with discrete energy levels. With elementary 
particles the energy corresponding to their rest-mass can only transform into 
the active form of radiation as a whole, or not at all. A small loss of mass is 
impossible. This has been verified experimentally in the electron-positron 
case but is also believed to be true for all other elementary particles. As the 
predominant part of the mass of atoms has the form of the mass of elementary 
particles the impossibility of a small change of mass must also hold for atoms. 

Thus the reason for the particular stability with which the passive part 
of the energy is bound is of quantal character. 

It is worth noting that the division of energy, or mass, into an active and 
a passive part is of a relative nature. In ordinary chemical reactions not only 
intranuclear energy but also the energy of the inner electronic shells behaves 
passively. At very high temperatures, when a complete or nearly complete 
ionization of atoms becomes possible, the energy of the inner electron shells 
becomes active. Finally, in processes involving rearrangements within atomic 
nuclei the intranuclear energy becomes active as well. However, even then 
the energy corresponding to the rest-masses of the heavy elementary particles 
that form the nucleus remains in a passive state. 

The particularly durable binding of the predominant part of all energy, or 
mass, is the reason why one can speak of the laws of conservation of mass and 
of energy as two separate laws, although in Relativity the two laws coalesce 
into one. 

The law of mass conservation in chemical reactions was discovered and 
experimentally proved by Lomonosov and then confirmed by Lavoisier. As 
for the law of energy conservation, its precise formulation was only given in 
the nineteenth century by R. Mayer. However, Huygens in the seventeenth 
century and Johann and, especially, Daniel Bernoulli in the eighteenth century 
already used it in mechanics and Lomonosov was aware of its general character, 
as can be seen from his famous letter to Euler written in 1748. 
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GENERAL TENSOR ANALYSIS 


35. Permissible Transformations for Space and Time Coordinates 

As the basis of our mathematical formulation of Relativity Theory we 
chose the wave front equation 


, 1 /<ko\ 2 / dco\ 2 ldi o\ 2 / 8oj\ 

(Vw) ^ r* (it) “ ife) + U) + u) 


0 


(35.01) 


and the related expression for the square of the interval 

ds 2 = c 2 dt 2 — (dx 2 + dy 2 -f- dz 2 ) (35.02) 

((Yoi ) 2 is to be understood as an abbreviation for the differential expression 
on the left-hand side of the wave front equation.) If we introduce our usual 
variables 


x 0 = ct; aq = x; x 2 — y ; x 3 = z (35.03) 

and also the numbers 

Sq = 1 i &i ~ c 2 = 6 3 = 1 (35.04) 

the expressions (Vo>) 2 and ds 2 can be written as 



3^ 

and ds 2 = ^e^dx/c) 2 

k =0 


(35.05) 

(35.06) 


We know that both these expressions are invariant under Lorentz trans¬ 
formations. If new coordinates 


x' 0 = ct' ; x[ = x ; x' 2 = y' ; x' 3 = z (35.07) 

are introduced which are connected with the previous ones by a Lorentz trans¬ 
formation, we get 

< v ">‘=2''*($)’ < 35 ' 08 > 

and ds 2 = 2 e k{dx' k ) 2 (35.09) 

k =o 

Variables such as (35.03) or (35.07) in which (V<o) 2 and ds 2 have the forms 
(35.05) and (35.06) or (30.08) and (30.09) will be called Galilean coordinates, 
this term now being understood to include the time. 

We now assume that while x' Q , x' v x' 2 and x' 3 are given by (35.07) as before, 
and so are Galilean coordinates, the quantities x 0 , aq, x 2 and x 3 are no longer 
equal to (35.03) but instead are some auxiliary variables connected to x' 0 , a;', 
x 2 and x' 3 by relations of the form 

x a — / a (x 0 > x i> x a> *3) ( a = 0, 1, 2, 3) (35.10) 
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where the / a are arbitrary functions subject only to some general conditions. 
We shall assume that the equations (35.10) can be solved for x 0 , x lt x 2 and x 3 
so that their Jacobian must be non-zero 


D = D ( X 0, X i 4 '4) 0 

D(x 0 , Xy, x 2 , x 3 ) 


(35.11) 


Further we suppose that the functions f a have continuous derivatives of the 
first three orders. There will be other conditions on the f a which arise from 
physical considerations ; these will be examined later. 

If this change of variables is made (Vo) 2 becomes a homogeneous quadratic 
form in the first derivatives with respect to the variables x 0 , x lt x 2 and x 3 . 
We write this form as 


where 


a, 3=0 a P 


</ a0 = 


2 e * 


dx^dx^ 
8x k dx k 


(35.12) 

(35.13) 


Similarly, if the change of variables is made in ds i the result is 


with 


ds*= % g a& dx a dx & 

a, 3“0 

V' 8x. dx' k 
k= 0 a P 


(35.14) 

(35.15) 

(35.16) 


It is readily deduced from (35.13) and (35.15) that 

V g a'"* = S v = I 1 if v = p. 

| 0 if v ^ fi 

Hence if g is defined as the determinant 

g = Det (35.17) 

the quantities g will be the first minors of this determinant divided by g itself. 
Using the rule of determinant multiplication we get 


(3M8 > 

Here the second factor is equal to the Jacobian D of (35.11) and since 

e o e i c a e 3 = —1 (35.19) 

the first factor is — D. Consequently 

g = —D 2 (35.20) 

It is useful to restrict the choice of variables x 0 , x v x 2 and x 3 by conditions 
which ensure that x 0 , like x’ 0 , is of the nature of a time' whereas x lt x 2 and x 3 
are of the nature of spatial coordinates. These conditions must be formulated 
precisely. As before, we mean by the term “ event ” a spatial point considered 
at a particular moment in time; it may be called a “ point-instant ”. We 
demand that two events having the same values of the spatial coordinate 
parameters x u x 2 and x 3 but different values x* and x** for their time para¬ 
meters shall be in time sequence in the sense of Section 12. We know that for 
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events in time sequence the squared interval is positive. This must be true in 
particular for an infinitesimal interval, so if we take the difference x** — a;* 
to be infinitesimal and put 

x* = a? 0 ; x** = x 0 + dx 0 (35.21) 

we must have ds 2 — g 0ti dx% > 0 (35.22) 

whence g 00 > 0 (35.23) 

Assume further that we have two events with the same time parameter x 0 
but different values of the spatial parameters, namely (aq, x 2 , x 3 ) and (x x + 
dx v x 2 + dx 2 , x 3 + dx 3 ). We require that two such events shall be quasi- 
simultaneous in the sense of Section 12. For quasi-simultaneous events ds 2 
is negative, therefore we must have 

<h 2 — j> (Jtk dx { dxic < 0 (35.24) 

i, 1 

whatever the values of dx v dx 2 and dx 3 , provided not all three are zero. It 
follows that the quadratic form (35.24) must be negative-definite. It is a well 
known algebraic fact that the necessary and sufficient conditions for this are 
the inequalities 

9ll9l29l3 

9 21 9 22 9 23 <0 (35.25) 

9 31 932933 


9n 9a 

>0; 

922923 ^ q . 

9n9i3 ^ q 

(35.26) 

9 21 9 22 

932 933 

9 31 9 33 


9 ii < 0 ; 

9 22 < 0 » 

933 < 0 

(35.27) 


which, incidentally, are not all independent. An independent set of con¬ 
ditions is, for instance (35.25), the first inequality of (35.26) and the first of 
(35.27). 

It is not difficult to show that if all these conditions are imposed on the 
coefficients g a(3 then, regardless of whether they are given by (35.15) or not, 
it is possible to represent ds 2 in the neighbourhood of any point as the sum of 
four squared terms, one with a positive and the remaining three with negative 
signs. The set of signs of the terms is called the signature of the quadratic 
form. In our case the signature can be written as (e 0 , e u e 2 , e 3 ) or as (~\ -). 

It follows from the inequalities (35.25) to (35.27) that the determinant g 
is always negative and also that similar inequalities involving the g 01(3 with 
upper indices hold, consequently we have 

g 00 > 0 (35.28) 

and 2 9 ik <*«»ic < 0 (35.29) 

t , *=i 

where coj, and co 3 are any three numbers, not all zero. We shall not give the 
proofs of these purely algebraic statements. 

Thus, in order that the parameter x 0 should have the character of time and 
the other three, x lt x 2 and x 3 , the character of spatial coordinates, it is necessary 
and sufficient that g 00 should be positive and that the quadratic form with 
the coefficients gw (i, k =- 1, 2, 3) should be negative-definite. There is no need 
to impose any restriction on the quantities g 10 , g 20 and g 30 . 
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Let us now consider the geometrical meaning of the equations x 0 = const, 
and Xk — const. We shall derive a condition under which the equation 

co(x, y, z,t) — 0 (35.30) 

can he interpreted as the equation of a surface in motion. It follows from this 
equation that the differentials of space and time coordinates, are related by 

0 ) 3 ; dx -j- c ay dy -f- coz dz -j- cot dt 0 (3o.31) 

where %, u> y , u> z and u>t denote the derivatives of to with respect to x, y, z 
and t. We take a displacement (dx, dy, dz) in the direction of the normal to the 
surface and put 


dx = --— r dn ; dy = 1 --r dn ; dz = --— r dn 

I grad to I I grad to | | grad to | 

(35.32) 

so that [dn| is the absolute value of the displacement. Inserting into (35.31) 
we get 

| grad to | dn + to tdt = 0 (35.33) 

and therefore the square of the displacement velocity 

(dn\ 2 

* 2 =U) (35 - 34) 

will be given by 


v* = ;- T—r. (35.35) 

(grad to ) 2 

Thus (35.30) can be interpreted as the equation of a surface, each point of 
which moves normally with a speed given by (35.35). However, such an inter¬ 
pretation is only possible as long as this speed does not exceed that of light. 
According to (35.35) and (35.01) this means that we must have 

(Vto ) 2 < 0 (35.36) 

The equality sign is valid for motion with the speed of light. 

On the other hand, if 

(V<o) 2 > 0 (35.37) 

equation (35.30) can be solved for the time and written in the form 


t = -/(*, y > 2 ) 

(grad/) 2 < 1 


(35.38) 

(35.39) 


Equation (35.38) assigns to every point in space a definite instant of time in 
such a way that all the four-dimensional “ point-instants ” are quasi-simul- 
taneous. Such an equation may be called a “ time-equation ”. We recall 
that time equations occurred in Section 3 in connection with the question 
of the characteristics of Maxwell’s equations. 

As we remarked in Section 3, an equation w = 0 can be considered as the 
equation of a hypersurface in the four-dimensional space-time manifold. Such 
hypersurfaces can then be divided into two classes. 

If (Vm ) 2 < 0 we can say that one of the dimensions of the hypersurface is 
time-like (the inaccurate phrase “ the surface is time-like ” is sometimes used). 
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By (35.35) this describes an ordinary two-dimensional surface'! moving with a 
velocity less than that of light. 

If (Vw) 2 > 0, on the other hand, we say that the hypersurface is space-like. 
It then represents the whole of infinite space, the various points of -which are 
all taken at different instants of time, the time t at which the point (x, y, z) 
is taken, being determined by the time equation, i.e. the equation of the hyper¬ 
surface ; the instants of time assigned to any two points in space must be so 
close that the corresponding four-dimensional interval is always space-like. 

We use the fact that (Voj) 2 is an invariant and in turn put w = x 0 , to = x v 
to == x 2 and co = x 3 . This gives 

(Vz 0 ) 2 = g 00 > 0 (35.40) 

and (Vaq) 2 = p n < 0 ; (Vx 2 ) 2 = p 22 < 0 ; (Va: 3 ) 2 - g 33 < 0 (35.41) 

Hence the equation x 0 — const, is a time equation and the three equations 
x/c = const. (k = 1, 2, 3) represent surfaces moving in the direction of their 
normals with less than light velocity. These latter are thus equations of moving 
spatial coordinate surfaces. 

It follows also from our conditions on the transformations of space and 
time coordinates that constant values of x v x 2 and a; 3 correspond, in any 
inertial frame of reference, to motion of a point with less than light velocity. 

In classical Newtonian mechanics one often uses a time dependent co¬ 
ordinate transformation which is interpreted as passing to a moving frame of 
reference. In comparing coordinate transformations in Newtonian mechanics 
with the transformations of space and time coordinates in the Theory of 
Relativity it is essential to realize the following. Firstly, in the general case of 
accelerated motion the very notion of a frame of reference in Newtonian mech¬ 
anics is not the same as in Relativity. The Newtonian concept involves the 
idea of an absolutely rigid body and the instantaneous propagation of light. 
In Relativity, on the other hand, the notion of a rigid body is used, if at all, 
not in an absolute sense but only for non-accclerated motions and in the absence 
of external forces, and is of an auxiliary nature ; the concept of a frame of 
reference is not based on it but on the law of wave front propagation. The 
prototype of a Newtonian frame of reference is a rigid scaffolding, the prototype 
of a Relativistic one is the radar station. Secondly, the class of transformations 
permissible in Newtonian mechanics is much wider than in the Theory of 
Relativity ; Newtonian mechanics does not have to consider the limitations, 
discussed above, which arise from the existence of a limiting speed. 

As an example we consider a transformation which can be interpreted in 
Newtonian mechanics as going over to a uniformly accelerated frame. Let 
x', y', z' and t' be the space and time coordinates in an inertial frame, i.e. 
Galilean coordinates. We put 

x =- x — - at 2 : y' — y ; z — z (35.42) 

and also /' l - — lx. (35.43) 

c- 

f In the four-dimensional manifold a hypersurface has three dimensions but in the 
present ease only two of those are spatial. 
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The variables x, y, z and t can be interpreted as space and time coordinates 
in a certain accelerated frame (in the Newtonian sense and in the corresponding 
approximation). Inserting (35.42) and (35.43) into the expression for ds 2 we get 

a ^ 

ds 2 = (c 2 — 2ax — aH 2 ) dt 2 — dx 2 — dy 2 — dz 2 H- (xdt At dx) 2 (35.44) 

c 2 


The required inequalities for the coefficients will hold if the conditions 


1 - 




(35.45) 


are satisfied. 


In addition we can require that 



ax 

-i>° 
c 2 


(35.46) 


These inequalities show that the substitutions (35.42), (35.43) are permissible 
only in a part of space and only for a limited length of time. 

Another example is the transformation corresponding to the introduction of 
a uniformly rotating frame. We put 


x — x cos ot + y sin u>t ; z ■= z 

y — — x sin cot + y cos rot; t' — t 


(35.47) 


and obtain 


ds 2 — [e 2 — co 2 (x 2 + y 2 )] dt 2 — 2to(y dx — x dy) dt — dx 2 — dy 2 — dz 2 

(35.48) 

The conditions on the coefficients require 

c 2 - <o 2 (x 2 + y 2 )> 0 (35.49) 

which is satisfied only for distances from the axis of rotation less than that 
where the linear velocity of the rotation equals the speed of light. 

We stress once again that the examples given here have physical sense only 
in a region in which Newtonian mechanics is applicable (see also Section 61). 

It is obvious that the introduction of ordinary curvilinear spatial coordinates 
is always an allowed transformation. As long as the transformations do not 
involve time they have the same geometrical meaning as in non-relativistic 
theory. Therefore we refrain from discussing them. 


36. General Tensor Analysis and Generalized Geometry 

In the previous section we considered the expressions 


and 


_ V' , C-Cr) (CO 

(Vo,) 2 = > y* - 
X—I ox 


dx,. 


(36.01) 


ds 2 = 2 0^dx a dx^ (36.02) 

a, [4 0 

which were obtained from the usual expressions of Relativity Theory by 
introducing variables x v x.,, x 3 and x 0 in place of the space and time coordinates 
x, y, z and t. We established the conditions subject to which the variable x 0 
can characterize a sequence of events in time and the variables x v x 2 and x 3 
their location in space. 
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By itself, the introduction of new variables can naturally not influence the 
physical consequences of the theory ; it is merely a mathematical device. How¬ 
ever, the development of a formalism which permits the statement of equations 
of mathematical physics (such as equations of motion and field equations) 
directly in terms of arbitrary variables without the use of cartesian spatial 
coordinates and time, is not only useful as a device for convenient computa¬ 
tion but is also important in principle. The existence of such a formalism can 
show the way to generalize physical theories. 

We shall call equations generally covariant, if they are valid for any arbitrary 
choice of independent variables. The formalism that allows one to write down 
generally covariant tensor equations will be called “ general tensor analysis. ” 
Generally covariant equations are already used in Newtonian mechanics. 
We refer to Lagrange’s equations (of the second kind) which describe the motion 
of a system of mass points in generalized coordinates and also their generalization 
for continuous media. While they state nothing physically new as compared 
to equations in cartesian coordinates, Lagrange’s equations nevertheless play 
an important part both in practical applications and in theoretical investiga¬ 
tions. In the Theory of Relativity general tensor analysis has a similar purpose. 

In general tensor analysis the starting point is the pair of equations (36.01) 
and (36.02) giving the square of the four-dimensional gradient and the square 
of the interval. One says that these expressions characterize the metric of space- 
time. The coefficients g ^ and g rj o entering the equations are thought of as 
functions of the variables x 0 , x lt x 2 and x 3 . 

We have so far assumed that the expressions (36.01) and (36.02) are derived 
from (35.01) and (35.02), or from (35.08) and (35.09), by introduction of new 
variables, so that the coefficients g l '‘‘ and g rt[ , t can be represented in the forms 
(35.13) and (35.15). In this case the ten coefficients g a(i can be expressed in 
terms of the four functions/ 0 ,/j,/ 2 and f 3 as follows : 


9a& ^ 


c>fk hjk 


e* — -7— (36.03) 

C'X uXn 

k-= o a P 

By virtue of (35.16) the ff' Jj can be expressed in terms of the same four functions. 

However, it is important to note that the equations of general tensor anlaysis 
are hardly any more complicated if it is not assumed that the g^ can be rep¬ 
resented in the form (36.03) but that instead they are taken simply as given 
functions of the coordinates, i.e. of the variables, x 0 , x x , x 3 and x 3 . This more 
general point of view corresponds to the introduction of non-Euclidean geometry 
and a non-Euclidean metric in space time. Such a step takes one beyond the 
limits of ordinary (so-called “ Special ”) Relativity and is connected with the 
formulation of a new physical theory, namely Einstein’s Theory of Gravitation. 
Later chapters of this book are devoted to this theory, but in this chapter, we 
adopt a purely formal point of view and develop general tensor analysis on 
the assumption that the metric is given and the g afJ are known functions of the 
coordinates. Such a presentation has two advantages. In the first place we 
can find the conditions which the g a3 must satisfy in order to be expressible 
in the form (36.03) ; this gives us a generally covariant formulation of the 
usual Theory of Relativity. In the second place we obtain in this way the 
mathematical apparatus for formulating Einstein’s Theory of Gravitation. 
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Before going on to a systematic exposition of general tensor analysis we 
establish the connection between the expressions (Vo) 2 and ds 2 which exists if 

= K (36.04) 

a —0 

regardless of whether the y aii are of the form (36.03) or not. We show that if a 
function co(x 0 , x t , x 2 , x 3 ) satisfies (Ym) 2 — 0 then differentials of the coordinates 
related by the condition co = const, satisfy ds 2 — 0. 

Putting 

= P (a = 0, 1, 2, 3) (36.05) 

CX a. 

We write (Vco) 2 = 0 in the form 

G = 2 = 0 (36.06) 

a. p^-0 

This partial differential equation for to is of the same type as the Hamilton- 
Jacobi equation of classical mechanics and can be solved similarly to the 
latter. If we solve it for o> 0 and write 


o) 0 — —#(«<>!, ou, o> 3 ) (36.07) 

the function H will correspond to the Hamiltonian and Hamilton’s equations 
will be 


But 


dxk 8H deo/c 

dx 0 disk ’ dx. o 


8U 

Sx k 


(k = 1, 2, 3) 


(36.08) 


^ = {8G/dl0k) (36.09) 

8<jtk 8is k ( 8Gj 8(s 0 ) 

and the first three equations of (36.05) show that the differentials dx a (a = 0, 
1, 2, 3) are proportional to the partial derivatives of G with respect to the co a . 
Denoting the infinitesimal coefficient of proportionality by \dp, we have 


ds a " “7 ^7 = d P 2 (36.10) 

“ p = o 

Solving for oi a with the use of (36.04) we get 

d P= 2 <7 a(J dx ;i (36.11) 

P~o 

and the obvious relation 


then gives 


I w a dx , = 0 


a~0 


(36.12) 


ds 2 = 2 (j, 4i dx a dx & = 0 (36.13) 

a. (i=0 

as required. Thus if we continue to consider the equation (Vw) 2 — 0 as des¬ 
cribing a wave front we can take it that for points on the wave front the differ¬ 
entials of space and time coordinates are related by ds 2 — 0. 

1 n the following we shall consider the as given functions of the variables 
x 0 , x v x 2 and x 3 and shall merely assume that they have continuous derivatives 
of all orders considered and that they satisfy the inequalities stated in Section 
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35. In addition to the g*,, we shall consider the g ,J -'\ their connection being 
(36.04). The conditions under which the g afi can be represented in the form 
(36.03) will be established in Section 42 below. 

37. The Definitions of a Vector and of a Tensor. Tensor Algebra 

In tensor analysis one frequently has to deal with sums similar to those in 
(36.01) and (36.02), in which the summation index appears twice. Following a 
suggestion by Einstein we introduce an abbreviated notation for such sums 
which consists of omitting the summation sign, summation over indices appearing 
twice being understood. We further agree to sum from 0 to 3 if the indices 
are Greek characters such as («, p,.. .) and from 1 to 3 if they are Roman ones, 
such as ( i , k, . ..). Using this notation we can, for instance, replace 

ds 2 = | g a3 dx* dx, j (37.01) 

a, £J = 0 

simply by 

* 2 = <7*3 dx* dx, (37.02) 

or, if we wish to single out the time-like coordinate x 0 , 

ds 2 — < 7 00 dx% + 2 g ot dx 0 dx ( f g ik dx t dx k (37.03) 

This short notation proves very convenient and does not lead to misunder¬ 
standings. In the rare cases when summation over a double index is not to be 
understood we shall state this explicitly. For instance, in the special case 
when (36.02) reduces to 

ds 2 — dx g — dx i — dx f — dx\ (37.04) 

we write g x;i = ef>*„ (no summation) (37.05) 

In Section 20 we gave the definitions of a covariant and of a contravariant 
vector for the case (37.05). Equations (20.12) and (20.13) can now be written as 


A _ £|fp j 

(37.06) 

“ ex' 11 

a 

Sr! 

A'* = —- A® 

(37.07) 


and they can also serve as definitions of a vector in the general case. As before, 
covariant/ vectors will be written as letters carrying a sullix while for contra¬ 
variant ones a superfix is used. However, for the coordinate differentials we 
make an except ion to this rule, writing them as dx* although they form a contra¬ 
variant vector. 

Thus a covariant vector can be defined as a set of four quantities that trans¬ 
form like the four partial derivatives of some function with respect to the 
coordinates. Similarly, a contravariant vector is a set of four quantities that 
transform like the four coordinate differentials. 

In the case when ds 2 has the form (37.01) and when we consider only Lorentz 
transformations, the. coellicients in the transformation formulae (37.06) and 
(37.07) are constants. In the more general case of arbitrary transformations 
the coefficients are variable, in the case of Lorcnt/, transformations a vector 
need not necessarily refer to a definite point in space-time but ma v be “ free 
An example of a free vector is the energy-momentum vector of a material 
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system. (Similarly the tensor of angular momentum and mass centre motion 
is a free tensor.) By contrast, if one considers general coordinate transformation, 
then, just as for ordinary curvilinear spatial coordinates, a vector must neces¬ 
sarily be related to a particular point in space. Examples of bound vectors are 
any field vectors, such as the velocity vector for a continuous medium, whose 
components are functions of a point, i.e. of the coordinates x 0 , x lt x 2 and x 3 . 
However, a bound vector need not be defined throughout a finite region of space- 
time ; it may be defined only on some curve as is the case for the tangent vector 
of a curve, or on some surface as for the normal to a surface. The same remarks 
apply to tensors, and in general tensor analysis we shall always have to do with 
bound tensors as well as bound vectors. When these are transformed the 
quantities dx'jdxo entering the transformation must always be taken at the 
same point as the vector or tensor to be transformed.t 

This definition of a tensor is analogous to the one given in Section 21 for the 
case of Lorentz transformations. Equations (21.01), (21.03) and (21.05) which 
give the transformation rules for second order covariant, contravariant and 
mixed tensors respectively remain valid in general tensor analysis. In our 
present notation they read : , 

± (TV 


and 


_ 


y'n —_if 


8x'^ 0x' 01(3 

(37.08) 

8x a 8x & 

K d _H T « 

dx a 8x' v p 

(37.09) 

(37.10) 


Thus a covariant, contravariant or mixed tensor of the second order is a set of 
quantities transforming, respectively, according to (37.08), (37.09) or (37.10). 

Covariant or contravariant, but not mixed, tensors which have either of the 
properties of symmetry or antisymmetry, retain that property after trans¬ 
formation. For on renaming the suffixes in (37.08) we obtain 



T , dX a jt 

‘ v|t 8xl 

(37.11) 

Therefore, putting 

2S <x.ti = T afi + T »a 

(37.12) 

and 

ad = ^ap — ^pa 

(37.13) 

and defining S'^ and A’^ 

correspondingly we get 



__ 8x a Sxp 
^ Sx^ 8x v “ P 

(37.14) 

and 

_ Z X a 8x d A 

“ v 8x'„ 8x' 

(37.15) 


The quantity <S af3 can be called the symmetric part and A a& the antisymmetric 
part of the tensor T a& . Equations (37.14) and (37.15) show that each of these 
two parts separately is a tensor. Therefore, if the tensor T a ^ is itself symmetric, 
so that A a[i = 0, then also A'^ = 0, i.e. the transformed tensor T'^ is also 
symmetric. Similarly, if is antisymmetric so that - 0 then likewise 
S'^ = 0 and the transformed tensor T'^ remains antisymmetric. The same 


f For further discussion of free vectors see Section 49. 
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considerations apply to a contravariant tensor, but for a mixed tensor the 
upper and the lower index enter the transformation equation (37.10) differently 
and therefore it has no invariant meaning to divide a mixed tensor into a 
symmetric and an antisymmetric part. 

An extremely important example of a symmetric tensor of second rank 
is the set of coefficients g afi in ds 2 . The fact that the g a& are symmetric in their 
two suffixes follows directly from their definition. That they form a tensor 
follows from the invariance of ds 2 . For on passing to new variables a;', x' v 
x 2 and x 3 we have 

gf dxf dx' v = g a0 dx a dx ? 

, , 8x & 

whence = U ^ ^ 

which is just the tensor transformation rule. The tensor g a g is called the 
fundamental or metric tensor. 

The set of quantities p a(3 defined by the equation 

9*\y = 8? (37.18) 

also form a tensor, which in this case is contravariant. We proceed to prove 
this. In the primed coordinate system the gf are given by (37.17) and the 
g'w have to be calculated from the equation 

9 ,vX 9i = 8? (37.19) 

On the other hand, if g a " is to be a contravariant tensor, we must have 


(37.16) 

(37.17) 


= a *3 K K 

dx z 8x (i 


(37.20) 


We must show that both definitions of the g' t ’-' J give the same. Since the solution 
of (37.19) for the g 1 ' 1 ' 1 must be unique it is sufficient to verify that the quantities 
given by (37.20) satisfy (37.19). This is simply done by use of the equations 

8x a 8x' x dx a 


and 


b'-h dx g dx 9 Sp 

, _ 

Oh) o, 0 p 


dx. 


2 

/po 8x! 


(37.21) 


(37.22) 


Inserting (37.20) into (37.19) and using first (37.22) and then (37.18) we obtain 
the sequence of equations 

Sx' 8 j\ 8x„ 8x„ „ dx', 8x„ 8x'„ 8x„ 


9'^vi 


= a x& a ’ 1Z? = a ZZit _I5 = ZZH II? = Il£ II? _ gu 

* *** 8x a ox 0 ' 8x„ 8x'„ ° dx. dx. dx. dx'. 


(37.23) 

which finally leads to (37.19). 

The g*® are called the contravariant components of the fundamental tensor, 
the g^ being its covariant components. 

The equation 

7T TP = (37 - 24) 

8x a 8x v 

which forms part of (37.23) shows that the set of quantities 

gu = / 1 if H = v 

v (0 if p. ^ v 


(37.25) 
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which are the same in all coordinate systems, satisfy the definition of a mixed 
tensor of second rank. 

Given a covariant vector A v the set of quantities 

A' L = g^A v (37.26) 

will transform as a contra variant vector, as is easily verified. We do not consider 
two vectors so related as essentially different and refer to the A y and the A* 
as the covariant and contravariant components of the same vector. The 
operation described by (37.26) is referred to briefly as raising the index of the 
vector A v and the inverse operation 

A = ^ (37-27) 

is called lowering the index. Raising and lowering of indices can also be per¬ 
formed on tensors. For instance, from the covariant tensor T we can obtain 
the contravariant tensor 

T v ‘ = (37.28) 

by raising both indices, and the original tensor can be restored by the inverse 
operation 

= 9^9*?* (37-29) 

It is evident that the properties of symmetry and antisymmetry are not 
affected by this. In forming a mixed tensor we must observe which of the 
indices is raised or lowered. For clarity we can mark the position of the raised 
or lowered index by a dot. For example, the tensors 


and 


T»- = (37.30) 

(37.31) 


are identical only is symmetric, in which case the dots are unnecessary 

and we can write 

T': = gTT m = (37.32) 

For the sake of brevity we have hitherto discussed only vectors and second 
rank tensors, but tensors of higher rank may be defined correspondingly. 

A set of quantities is called a covariant tensor of rank n if it undergoes the 
transformation 


A’ 


Plp2- - • Pu 


dx 8x 8x 

I _yi _ ^2 _a. 

a l a 2 ■■■ v -n 8x a dx’r. 8x' 

Pi r2 Pn 


—^ (37.33) 


Similarly a set of quantities is a contravariant tensor of rank n if it transforms 
as follows 


B’lhth ■■■?,, _ 2J“i a 2--- a n Sx ^ 1 . . . Sx ^ n 
dx dx dx 
«1 «2 a « 


(37.34) 


Finally a mixed tensor of rank n with k covariant and m contravariant indices, 
where k -f m = n, is a set of quantities transforming according to the rule 


vi...v* Pi 


dx dx 8x a 8x a 

a-rn !fl V-m £1 _P k 

Pit dx 8x dx dx 

“1 '1 v * 


(37.35) 
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From a covariant tensor of rank n we obtain a contra variant tensor of the same 
rank by the relation 


A v-1 ... ■• in. = A ai ... an g™ . . . f#* (37.36) 

and we can speak of these quantities as the covariant and contravariant com¬ 
ponents of one and the same tensor. If only some, but not all, indices are 
raised we can obtain mixed tensors of the same rank and, in general, we should 
indicate the position of the indices that are raised by the use of dots. If we 
have two tensors of rank k and m we can construct a tensor of the higher rank 
n — k + m by multiplying components. For covariant tensors we may put 

A «i • • • °=*-®a*+i. • • = • • • n (37.37) 


and similar equations hold for contravariant and mixed tensors. 

Given a tensor A„ „ of rank n we can construct another tensor B„ „ 

by the rule 

A <z 1 ...a n _2M7i 3 '*' = •®a 1 ...a n _2 (37.38) 

The rank of the new tensor is two less than the rank of the original one. This 
operation is called contraction with respect to the two indices involved. In 
(37.38) the contraction is with respect to the last two indices and it is evident 
that the result of contraction will depend on which pair of indices is chosen. 
Contraction may be performed in two stages. First one of the suffixes is raised 
according to the rule 

A oPy = A.. . p (37.39) 

a l •• • a n-Z a n-lY‘ # ' ' 


and then the other suffix, a n _ 1 , is equated with the raised index [3 and the 
summation is performed 


A . I = B 

“l--.“n-2P -r-n-2 


(37.40) 


Contracting a second rank tensor produces a scalar 

= T 

which can also be given as 


(37.41) 


= Tf = T (37.42) 

That the contraction of either of the two tensors (37.30) and (37.31) leads to 
the same result is easily understandable because the scalar T depends only 
on the symmetric part of the tensor T and for a symmetric covariant tensor 
both the related mixed tensors are the same. 

A vector cannot be contracted because it has only one index, but from two 
vectors one can construct a second rank tensor which may then be contracted. 
The result is the scalar 


gr A A = A » B * = A ' B , ( 37 - 43 ) 

which can be called the scalar product of the two vectors A and B v . If the 
two vectors coincide, the scalar is 


sr A A> = A » A * ( 37 - 44 ) 

According to the sign of the scalar product of a vector with itself, all vectors 
may be classified as being time-like, if A A* > 0, space-like if A A* < 0, or 
nul vectors if A A* — 0. This classification is the same as that discussed in 
Section 20. 
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If two time-like vectors A* 1 and B* both have unit invariants the absolute 
value of their scalar product will be greater than unity. In other words the 
conditions 



g^AW = 1; g^B' = 1 

(37.45) 

lead to the inequality 

kv^l > 1 

(37.46) 

This will now be proved. Condition (37.45) for A* can be written as 


(VteooM 0 

+ 1 g M A‘Y ( 9M%k gjAW- 1 

V 9 oo > V 9 oo / 

(37.47) 

or more briefly as 

— (A 0 ) a — a t kA*A k = 1 

9 oo 

(37.48) 

with 

„ 9oi9ok 

Oik = - 9ik 

9 oo 

(37.49) 

Similarly 

^ — (B o r - aikBW = 1 

9 00 

(37.50) 

On the other hand the scalar product A & is equal to 



AJBr = — AoB 0 — a (k A i B k 

9 oo 

(37.51) 


By virtue of the inequalities for the g proved in Section 35 the quantities 
a« are the coefficients of a positive definite quadratic form. Therefore 


\a ik A<B k | ^ ^(auA'A*) ■ y/lfit&B*) (37.52) 

and if we put A — y/{an c A i A k ) ; B = (37.53) 

we obtain |4 0 | = VffooVO- + A 2 )'» | B o| = vVooVX 1 + B2 ) 

\an c A i B k \ < AB (37.54) 

Therefore > VC 1 + A*)y/(\ + B*) - AB ^ 1 (37.55) 

as required. 

For general time-like vectors it follows from (37.46) that 

\9vAW | > Viff^A') ■ VteJPB’) (37.56) 


and it should be noted that this relation is analogous to (37.52) except for a 
reversal of sign. This is due to the indefiniteness of the metric. 

For a time-like vector the covariant and the contravariant components of 
index zero always have the same sign. Assuming that A 0 > 0 and B 0 > 0, 
and thus also A 0 > 0 and B° > 0, we can write (37.56) in the form 

AP > V(-M v )V(^) (37-57) 

To conclude this section we discuss the concept of a pseudo-tensor in general 
tensor analysis. In Section 22 we introduced the set of quantities z a&yS which are 
antisymmetric in all their suffixes with e 0123 = 1. These quantities satisfy 
(22.02) which, in somewhat different notation, can be written as 
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Here D is the Jacobianf (35.11). On the other hand we can easily show by use 
of the rule of determinant multiplication that 

h*et £,3 = -D 2 Det g' XlL (37.59) 

This is a generalization of (35.18). Rewriting this equation in the form 

9 = D 2 g' (37.60) 

and taking the square root, after first changing the sign, we get 

V(-9) =\o\V(-9’) (37.61) 

This is the transformation law for the determinant g. We now multiply both 
sides of (37.58) by \/(—g') and put 

, = V(-ff)-W (37-62) 

and E' am =V(~9')-^ (37.63) 

Then we can write (37.58) in the form 


*£*£%*_ D . E 

°^' rS 8x 8x 8x 8x 11 

[L V p O 


(37.64) 


where sgn D = ±1 is the sign of the Jacobian D. This equation shows that for 
transformations with positive Jacobian the quantities iJ 0 transform as a 
covariant fourth-rank tensor while for transformations with negative Jacobian 
their transformation rule differs only by a sign from the rule for such a tensor. 
A set of quantities with such a transformation law will be called a pseudo¬ 
tensor. The corresponding eontravariant pseudo-tensor is obtained by the 
general formula 

= V(- 9 )- <r<7 v V (37.65) 

and is E^° = - - (37.66) 

The antisymmetric fourth-rank pseudo-tensor allows us to derive from any 
antisymmetric tensor A 8 of second rank a dual pseudo-tensor 


A' 1 * = ~ E^ s A yS 


(37.67) 

and from any antisymmetric tensor A^ yS of third rank a dual pseudo-vector 

(37.68) 




If we construct an antisymmetric tensor A„_, R from the three vectors a,„ b„, c„ 
according to the rule 

A j3yS = a pf ) y C S "b GybftCfi + a s b^c y 

the dual pseudo-vector has the components 
1 


a yb & C S 




a s b y c & 




(37.69) 


* 

A° = 


* 

A 2 = 


V(~ 9 ) 


1 


V(- 9 ) 


a i 2 ® 3 

^1 ^2 ^3 
C 1 C 2 C 3 

«0 ®1 ^3 
&0 ^3 

C 0 C 1 C 3 


+ 


. 4 3 = + 


Vi-9) ,b ‘ b ' b 


a 0 Cl 3 

2 ^3 
c 0 C 2 C 3 


9 ) 


a 0 Oi a 2 
ho hi h 2 

Ci Co 


(37.70) 


| In (22.02) D denoted the Jacobian of the inverse transformation. 
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These are proportional to the minors complementary to the elements £j, £ 2 
and £ 3 of the determinant 

&0 ^1 ^2 ^3 

C 0 Cj c 2 c 3 

j|{ jjc j|( 

Obviously A x a a — 0 ; A*b a = 0; A a c a = 0 (37.72) 

so that the pseudo-vector A. a is perpendicular to all three vectors a a , 6 a and 
c a . It may be called the vector product of these three vectors. 

38. The Equation of a Geodesic 

We consider two point-instants corresponding to two events in time sequence 
and we denote their coordinates by and a£ 2) respectively. Let a material 
point move along some curve in such a way that when x 0 = its spatial 

coordinates are x/c = and when x 0 = x ( 0 2> they are Xk = x'gK 

As the events x {2) and x t2> are assumed to be in time sequence such motion 
is possible with a speed less than that of light. The time x 0 and the spatial 
coordinates Xk corresponding to it can be expressed parametrically in terms of 
an auxiliary variable p, by putting 

x a = 9 a (p) (38.01) 

with x (2) = 9 °‘(p 1 ); = 9 a (p 2 ) (38.02) 

Since the speed of the motion is less than that of light the inequality 

ds 2 = flr a( j 9 c '(p (3 dp 2 > 0 (38.03) 

must hold for any infinitesimal interval along the path. Here a dot denotes 
differentiation with respect to p. The finite interval between the two events 
in time sequence which is proportional to the interval of proper time t, will be 
denoted by s — ex and we have 

S = CT = j V^s'P'V) • dp (38.04) 

v, 

We now consider two quasi-simultaneous events. The two points in space 
at whichthe events take place can be joined by some curve and to each point 
on this curve we can assign a definite instant of time, i.e. we can write down the 
“ time equation ” for each point, taking care that any two intermediate space- 
instants are quasi-simultaneous. The analytic expressions for the curve and 
the time equation may again be stated in the form of equations (38.01) and 
(38.02), but we can no longer interpret these equations as describing the motion 
of a point along a curve ; they now give a static description of the curve as a 
whole. For any pair of intermediate points, infinitesimally separated, we have 

ds 2 = < 7 a 3 <p a <p e dp 2 < 0 (38.05) 

and the space-like interval 

1 - J v( ' dp 

i >i 

characterizes the length of the curve. 


(37.71) 


(38.06) 
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The question arises of the extremal values of both the time-like interval 
(38.04) between two events in time sequence and the space-like interval (38.06) 
between two quasi-simultaneous events. Both these variational problems lead 
to equations of the same form, whether the interval is time-like or space-like. 
The variational equations are called the equations of the geodesic by analogy 
with the theory of surfaces. However, it is important to note that whereas 
in the theory of surfaces, where the square of an infinitesimal distance is a 
positive definite quadratic form of the coordinate differentials, the geodesic 
is, generally speakingf, a shortest line; in the four-dimensional space-time 
manifold the situation is different: the extremal value of the interval is a 
maximum for a time-like interval and neither a maximum nor a minimum for 
a space-like interval. This can easily be verified in the special case of the 
Galilean metric where ds 2 has the form (37.04). For events in time sequence 
we can then choose a reference frame so that the spatial coordinates of the 
initial and final points are the same and we can choose the time t as the para¬ 
meter. We then have 

t<*> 

s = jV(c* -v 2 )-dt (38.07) 

e' 1 ’ 

where 


Msr+(i)' + 


(38.08) 


The solution of the variational problem in this case is given by constant values 
of x, y and z, so that v 2 = 0. For any other trajectory v 2 will somewhere be 
greater than zero, so that \/{c 2 — v 2 ) < c and therefore 

s<W = <*< (2) -*<») (38.09) 

For the space-like interval we can choose a frame of reference such that 

f< » = f<l> ; y(2) = yU> ; = Z<» (38.10) 

while x (2> > x a) . Taking the coordinate x as the parameter we obtain 

The solution of the variational problem is now given by constant values of 
y, z and t for which then 


l** = (38.12) 

However, for other curves y(x), z(x) or other time equations t{x) we may find 
either l > l extt or l < l extt depending on whether the square root in (38.11) 
is in the mean greater or less than unity. 

We now derive the differential equations of the geodesic. The Lagrangian 
of the variational problem is 

L = V'l&.s'?*? 3 ) (38.13) 

or, writing x a instead of <p“ 

L = VMA) 

t i.e. for sufficiently near terminal points. 


(38.14) 
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The extremal condition for the integral 


s = J L dp 

Pi 

leads to the Euler-Lagrange equations 

d 8L 8L 

(38.15) 

dp 8x a 8x a 

We now put 

F = \ 

(38.16) 

(38.17) 

so that L — y/(2F) 

(38.18) 

By the same reasoning as in Section 17 we can choose 

dF 

a parameter p so that 

— = 0; F — const 

dp 

and with this choice (38.16) is equivalent to 

d 8F 8F 

(38.19) 

dp 8x a 8x a 

These last equations possess the integral 

(38.20) 

8F 

x„ - F = F — const 

a a* 

(38.21) 


so .that the condition (38.19) is a consequence of (38.21). Inserting the explicit 
expression for ¥ we obtain from (38.20) 

iifv, 

or, performing the differentiations, 


0 


(38.22) 


».rt + 0 < 38 - 23 > 

The coefficient of tbgb can be symmetrized with respect to [3 and y; if we put 

) (38.24) 


%«p | g gar 


[Py> *3 = ^ ( 

£ \ U LTJUq 

the differential equation of the geodesic becomes 


dx„ 


+ [j3y, a] x^x y = 0 (38.25) 

The expression (38.24) is called a Christoffel symbol of the first kind. In 
order to solve equations (38.25) for the second derivatives we multiply them by 
9 av and sum over a. Then with the new symbol 

{PY> v ) = ^[Py. *3 (38.26) 

we obtain x v + {[By, v} x^x r = 0 (38.27) 

The expression (38.26) is called a Christoffel symbol of the second kind and is 
often represented by an alternative symbol 

(Py> v 3 = r$ Y 


(38.28) 
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For uniformity we can also introduce a corresponding form of notation for 
Christoffel symbols of the first kind : 


[ap,y] = r Y . ai3 

(38.29) 

but this is less common practice. 


We have thus 


r _ 1 / dg ™ I 
v ’ a|3 2 \ ftcg 8x a 8x y ) 

(38.30) 

and F\ = - 0 vii 

aS 2 ^ \ 8x (i 8x a bxj 

(38.31) 

In this notation the equation of a geodesic takes on the form 


d 2 x _ dx r , dx,. 


_y i pv _?_S __ a 

dp 2 dp dp 

(38.32) 


If the Christoffel symbols correspond to a metric tensor that can be written 
in the form (35.15) equations (38.32) are equivalent to the relations 

= 0 (k = 0,1, 2, 3) (38.33) 

for the Galilean coordinates x' k . This follows from the covariance of the equations 
and the fact that in Galilean coordinates the Christoffel symbols vanish. In 
this case, therefore, the equation of a geodesic leads to linear dependence of the 
Galilean coordinates on the parameter p. 

It is not difficult to verify that the development leading to (38.32) remains 
valid whatever the sign of F. If F > 0 the “ geodesic ” joins two events in 
time sequence and equations (38.32) can be interpreted as the equation of 
motion of a free mass point moving with a speed less than that of light. The 
increment dp of p is proportional to the increment dx of the proper time t 
and (38.32) may be replaced by 


d\ 

di 2 


+ r 


v dx^dx^ 

ap dv dx 


0 


(38.34) 


The length of the geodesic gives the interval of proper time between the 
“ departure ” and the “ arrival ” of the mass point. If on the other hand 
F < 0, the geodesic joins two quasi-simultaneous events and we can put dp 
equal to the increment of the spatial interval. Equations (38.32) then appear as 


d% 

~dF 


+ r 


v dx^dx^ 
“ 0 dl dl 


= 0 


(38.35) 


The case F = 0 corresponds to a point moving along a ray with the speed 
of light. In this case the Lagrangian (38.18) is zero and the above derivation 
of the geodesic equation is no longer valid. However, the equations (38.32) 
themselves retain their meaning and as they possess the integral (38.21) they 
are compatible with the condition F — 0. To justify the equations in this 
case we can start from the Hamiltonian equations that were discussed in 
Section 36. According to (36.08) we have 


dx/c 8H cUo/c 

dxQ 8co fc dx q 


8H 

dx* 


(k = 1. 2, 3) 


(38.36) 
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where the Hamiltonian H = —to 0 is obtained by solving for w 0 the equation 


+ 


8G 

8<nic 


d(Aic] 


(38.37) 


(38.38) 


(38.39) 


6=^ = 0 

Therefore we have 

dtf=-dc> 0 =— 

Using the fact that 

d(x> 0 dH 8H 

dxj dxQ 8x^ 

and expressing the derivatives of H in terms of the derivatives of G we can 
write equations (38.36) in a symmetric fashion : 

dx 1 8G dto 1 8G 

4 = §sr. ; 4 = <“=°' 1A3) (38 40) 

Here dp is considered to be the differential of the independent variable p. 
The first four equations of (38.40) have already been given in Section 36. 
Writing the right-hand sides explicitly we get 


(38.41) 


dx a afi 1 

dp ^ dp 2 8x a * v 

It is readily seen that these equations are equivalent to (38.32), for we have 

dx x 


and therefore 


since 


8g^ 

8x„ 


8gv- 

= ~dx~ 9vX 


9vx 

dx-. 


dp 


dp 


8 9p, dx i 


dg^ 8g x 

r. 6W + d p. 


8 

8x_ 


(sTfa) 


8x g 

8 

dx. 


dp 

(K) 


(38.42) 


(38.43) 


= 0 


8x„ * dx 

CL (* uk (A 

Inserting (38.43) into (38.41) we get 

do>„ 1 8q t . dx , 

dp 2 J v 8x a dp 

or, in consequence of the first set of equations in (38.41), 

&■>. _ 1 ^ ^ 

dp 2 &r a dp dp 

Eliminating the w a from these equations and (38.42) we finally obtain 


(38.44) 


(38.45) 


(38.46) 


d 

dp 


* / ..... 1 d 9& dx * 1 

r p r 11(1 dp / 2 dp 


(38.47) 


dp dp 

These equations are the same as the equations (38.22) from which the equations 
of the geodesic in the form (38.32) were derived. The passage from (38.41) 
to (38.47) is the usual one from Hamilton’s to Lagrange’s equations. 

We have thus proved that the geodesic of zero length is likewise determined 
by the equations (38.22) but with the condition F = 0 adjoined. 

It should be noted that because F is constant, a geodesic retains its character for 
its entire length; it may always describe the motion of a point with aspeed less than 
that of light or it may be a null-line or, finally, it may be everywhere space-like. 

For a null-geodesic the relation (38.37) with (o a = du>/dx a may be con¬ 
sidered to be the Hamilton-Jacobi equation for the action function co. (See 
Section 36.) The Hamilton-Jacobi equation for the general case can also be 
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readily obtained. For definiteness we consider the case of a point moving with 
a speed less than that of light. 

Choosing the time t = x 0 as the parameter and denoting differentiation 
with respect to it by a dot we can write the Lagrangian of the problemf in 
the form 

L — + V(9oo + 2<7o<as< + gik&iik) (38.48) 

The generalized momenta are 

8L 1 

T7- = pt = + 7 (goi + gtkXk) (38.49) 

ext L 

and the Hamiltonian is found by the usual rule to be the expression 

H = x t p { — L = — 2 ^<*o + yoic^ie) (38.50) 


with the velocities x * expressed in terms of the momenta pt by (38.49). 

If we put 

Po = j (?oo + gokXk) (38.51) 

and observe that Ldt = ds (38.52) 

where s is the length of arc, the four quantities pt, p 0 can be uniformly written as 


Pa — y*& 


dx^ 

ds 


dx a dx {i 

9 -> *-* =1 


(38.53) 

(38.54) 


The identity 
leads to the relation 

TVPv = 1 (38.55) 

which can be regarded as the result of eliminating the three velocities x lt x 2 
and x 3 from the four equations (38.49) and (38.51). The Hamiltonian H = —p 0 
is obtained by solving (38.55) for p 0 . The Hamilton-Jacobi equation is obtained 
by the usual rule of expressing p v p 2 , p 3 and II as partial derivatives of S with 
respect to the spatial coordinates and to time, as follows: 


„ 8S 8S 

These equations can also be written as 

8S 

Thus the Hamilton-Jacobi form of the equation of a geodesic is 

8S 8S 

8 ^ 8 ^ ~ 

If a complete integral of the Hamilton-Jacobi equation 
S = S (xq, x lt x 2 , x 3 , Cj, c 2 , c 3 ) -f- c 0 


(38.56) 


(38.57) 


(38.58) 


(38.59) 


f It is convenient to introduce the Lagrangian with opposite sign to the usual convention 
in mechanics. As a result, the sign of the energy will be opposite to the sign of the Hamil¬ 
tonian. 
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is known, which contains three arbitrary constants c x , c 2 and c 3 , not counting 
the additive constant c 0 , the derivatives of S with respect to the constants, 

~ = b k (k = 1, 2, 3) (38.60) 

OCk 

are also constants, as is proved in mechanics. They are determined from the 
conditions of the problem. 

A comparison of (38.58) with (38.37) shows that the equations of a null- 
geodesic are obtained from (38.58) by replacing the right-hand side by zero. 
For a space-like geodesic the right-hand side of the Hamilton-Jacobi equation 
is a negative constant which can be set equal to —1. 


39. Parallel Transport of a Vector 

In Euclidean space it is very easy to define what is meant by saying that 
two vectors at different points are equal and parallel. It simply means that their 
cartesian components are equal. Obviously this same definition holds) for 
vectors in a plane, and it can immediately be generalized to the case of vectors 
in a curved surface, provided the surface is developable. However, for a general 
non-developable surface parallelism of two vectors in it can be defined only if 
the points of application of the two vectors are infinitely close to each other. 
We can regard a vector on a surface as being a vector in space which is tangential 
to the surface at its point of application. Given a vector on the surface at a 
point P a vector at the adjacent point Q of the surface and parallel to the 
first vector in the sense of the geometry of the surface can be constructed as 
follows. We consider the given vector at P as a vector in space and we con¬ 
struct at Q the vector that is parallel to it in space in the usual sense. We then 
project this vector on to the tangent plane to the surface at Q. The tangential 
vector at Q is taken to be parallel to the given vector at P. 

This construction can be performed analytically as follows. Let y x , y 2 and 
y 3 be cartesian coordinates in Euclidean space and x x and x 2 the coordinate 
parameters on the surface. Then the parametric equations of the surface have 
the form 

Vi = Vi( x i> x *) ; Vi = Vi(x i, x 2 ) ; y 3 = y 3 (x x , x 2 ) (39.01) 

and the square of the element of arc on the surface is 

ds 2 = 9u + 2 g 12 dx x dx 2 + g 22 dxl (39.02) 


where 


ga — 2^ fixi dx/c 

n=i 


(39.03) 


Let A x and A 2 te the covariant components of some vector at the point 
P(x x , x 2 ) on the surface and A 1 and A 2 its contravariant components. We can 
look upon this vector as a vector in space with the cartesian components 



Y n = j£A' + ^A* (« *1,2,3) 

dx x ox 2 

(39.04) 

and we have 

A -2 Y - f £, 

n= l 

(39.05) 
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If on passing to the point Q(x 1 4- dx v x 2 + dx 2 ) we do not change the cartesian 
components Y n we get a vector in space which is no longer tangential to the 
surface. However, its tangential components define a vector on the surface, 
namely 

4 + M '‘i v -( sj + J «£=;*■*) (39 ' 06) 

and this vector is taken by definition to be the result of parallel transport of 
A i to the point Q, in the sense of the geometry on the surface. The normal 
component of Y n evidently drops out of equation (39.06). 

The expression that appears as an addition to dy n \'dxi in that equation takes 
account of the change of this quantity in going from P to Q. Owing to this 
additional term the component Ai receives the increment 

3 2 


s ^- 2>2 




dx/cdxt 


Bxjc 


Insertion of Y n from (39.04) gives 


a o 


fyn dhj n 


(39.07) 


(39.08) 


8xt 8x k dxi 

i,k =l »-l 

It is readily shown using the expression (39.03) for the gi k that the sum over 
n in (39.09) is 

8ya 
8x k 


y 8h Jn 1 / 8g ik | 8g u 8g k A Q9) 

A-i cxi dx k dxi 2 \ dxi dx k 8xt / 


or, using the definition (39.30) for the Christoffel symbols, 


y °yn 

8xi 


8y n 8 2 y n 


»=i 


8xt 8x k 8xi 


= n, 


kl 


(39.10) 


Thus the increment of the components of any vector in parallel transport is 


a 

8 At = ^ YuiaA^xic 


(39.11) 


1 


It is important to note that this increment depends only on the internal pro¬ 
perties of the surface, which are determined by the expression (39.02) for cU 2 . 

The theory of the parallel transport of vectors was developed by Levi-Civita 
[14] and his pupils; it can be formulated almost without change for the case 
of the four-dimensional space-time manifold. 

Let the coefficients of the quadratic form 


ds 2 = g a! . dx^ dx (3 


2 dy n 8y n 

e "aT 


be represented as 

8y n 8y n 

h 

where the numbers e n are 1 and the 

y n = y n {x 0 , x v x 2 , x 3 ) 


(39.12) 

(39.13) 


(39.14) 
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are some functions. In the ordinary Theory of Relativity the g a ^ are obtained 
from the quadratic form (35.09) and can be represented by (35.15); this cor¬ 
responds to N = 4. In the most general case we have ten g a& and to represent 
them in the form (39.13) not more than ten functions y n are required. (This 
can be rigorously proved.) It should be noted that since the signature of the 

quadratic form (39.12) is (-|-) there must be at least one positive 

and at least three negative e n . 

We can formally interpret the y n as cartesian coordinates in some many¬ 
dimensional pseudo-Euclidean space in which the metric is given by 

N 

dt ] 2 = ^ e » d y« (39.15) 

n = l 

Space-time is then a certain hypersurface in this many-dimensional space. 

To an ordinary contravariant vector A a in space-time there will correspond 
in the many-dimensional space a vector tangential to the hypersurface and 
having the cartesian components 

Y n = ^A* (39.16) 

dx a 

(Here and in the following summation from 0 to 3 is assumed for Greek indices.) 
Using (39.13) we thus obtain the following expression for the covariant vector 
components A a : 

(39.17) 

n=i ® 

In analogy with (39.06) we can define the values of the vector components 
after parallel transport to an adjacent point by 


A « +SA * = 2 e * r * ($ +&3 ) 
»“i a ® p 


whence 8.4 == "V e n Y n ^ ^ n 8x, 

Z-, Sx„dXn ' 

n—1 a p 

and, on insertion of the expressions (39.16) for Y n , 


(39.18) 

(39.19) 


84 = V e n ^L A r SXii 

“ ^ or dxdXr, 

Now, by analogy with (39.10), it follows from (39.13) that 

N 

■ST °H» o 
> en — — 

X—i OJ' ox 

n 1 • ! 

where the F y a(3 , are the ordinary Christoffel symbols 

r -==s (I); + dJ t„ - tf) (39 ' 22) 

Thus the formula for the increment of vector components due to parallel 
transport has the form 

Hc=r\,^ Y H (39.23) 

just as for an ordinary surface in Euclidean space. 


o>j„ o*y n _ r 
8x„ dx u Y ’ a!3 


(39.20) 


(39.21) 
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Equation (39.23) contains both covariant and contravariant vector com¬ 
ponents, but it is simple to state it entirely in terms of one or the other kind 
of components. We have 


At = g^A v 

and 

(39.24) 

^r Yia6 = r i3 

so that 

(39.25) 

8A a = 1^48*3 

(39.26) 

Only covariant components enter these equations. On the other hand we have 

8A a = 9ar 8A r +1^8a? = ^ ^ 8x & 

(39.27) 

and it is easily verified that 


p i p _ ^9 art 

(39.28) 

P 

Hence 



(39.29) 


and therefore in terms of contravariant components the required equation is 


S A' = - T^A* 8z b (39.30) 

We now consider the change in the scalar product of two vectors in parallel 
transport. We have 

S(A'BJ =B V 8A J + A* 8B a (39.31) 

We insert the expression for 8A V given by (39.30) and write 8B a according 
to (39.26) as 

8H a = iy? v 8* 0 < 39 - 32 ) 

It is then evident that the two terms on the right-hand side of (39.31) cancel, 
giving 

8 (A'B V ) = 0 (39.33) 

Thus the scalar product of two vectors remains unchanged in parallel transport. 
In particular, the magnitude of a vector is unaltered. 

We have hitherto considered infinitesimal displacements, but by adding 
up such displacements we can define the parallel transport along any given 
-curve. We assume that the coordinates of a point on a curve are given as 
functions of some parameter p : 

x rii = xfp) (39.34) 

The r^j, being functions of the coordinates, will then also be known functions 
of p. To obtain the vector A v as a function of p we have to solve the differential 
equations 

dA. dx n . 

m=- r * A 'TZ (39 - 36) 

If the values of A' 1 are given at the initial point of the curve we can integrate 
(39.35) and so obtain the values of the A'“ at the final point of the curve. In 
this way we perform the parallel transport of any vector from the initial to the 
final point of the curve. Evidently the result depends on the nature of the 
curve along which the parallel transport is made. 
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We compare the equations (39.35) with the previously given equations for 
a geodesic 


<K dxp 

dp 2 dp dp 


= 0 


The two sets are the same if we put 


A> = 


dx: 
dp 


(38.27) 


(39.36) 


If the geodesic is time-like, so that it describes a point moving with a speed 
less than that of light, the proper time t can be used as the parameter p and the 
vector A v will be the four-dimensional velocity. Thus in this case the equation 
of a geodesic can be interpreted (in four-dimensional terms) as the equation for 
parallel transport of the velocity vector in the direction given by this vector itself. 

The equations for the parallel transport of a tensor of any rank are easily 
obtained from those for a vector. As an example we consider the case of a 
covariant second rank tensor T^. We start from the requirement that the 
invariant 

I = Tp , AW (39.37) 

should not change in parallel transport, whatever the vectors A' 1 and P v . 
With some relabelling of indices we can write 8/ as 

8/ = AW (8^3 8*3 - T av r«3 8* e ) (39.38) 

Since this expression must vanish for arbitrary A* and B* we must have 

8rjp 8*3 + 3*3 (39.39) 

which is the required generalization of the equation of parallel transport. 


40. Covariant Differentiation 

In the case when the g were constants it was possible to treat the operation 
of differentiation as a symbolic multiplication by a vector; if, for instance, 
A v is a vector given as a point function in some region then, for constant g^, 
the expression V^A^ = dAJdx is a tensor with the same transformation 
properties as the product of the vectors and A s . Consequently in this 
special case tensor analysis and tensor algebra are formally identical. 

For variable g this is not so ; the derivatives of a vector with respect to 
the coordinates no longer form a tensor. However, it is still possible to construct 
certain linear combinations of the derivatives of vector components and of 
the components themselves which transform as a tensor. 

We consider a vector field A v defined in some region of space. The vector 
components are then functions of the coordinates. The change of the vector 
in going from a point P(*p) to an adjacent point Q(z & + 8*3) is 

(Ah - (AJ P = S,A V = 8*3 (40.01) 

However, we can also compare the values (AJ CJ of the components A v at Q 
not with their values at P but with the result {AJ” of their parallel transport 
from P to Q. According to (39.26) the change of a vector in parallel transport is 

W'g - (A)p = §A = IW* 3 (40.02) 
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Subtraction of this expression from the previous one to obtain the difference 
(X) Q - (AX = BA, = M v - 8 2 A v (40.03) 

'8A„ 


results in 


8 A. 


=(fc- r ^h 


(40.04) 


The quantity 8A y gives the difference between the actual change of the vector 
and the change it would experience in parallel transport. It is at the same time 
the difference of two vectors both referring to the same point Q. Therefore 
the 8A v form a vector. For arbitrary 8x„ this can only be true if the quantities 

8A„ „ , 

(40.05) 




8x n 


F v A 


form a covariant tensor of the second rank.f It is called the tensor derivative 
or covariant derivative of the vector and is the required generalization to 
variable q av of the ordinary derivative of a vector. 

By analogous considerations we can derive an expression for the covariant 
derivative of a contravariant vector. It has the form 


P Jv 


(40.06) 


The formulae of covariant differentiation are easily generalized to apply to 
arbitrary tensors. We consider first the second rank tensor T . According to 
(39.39) parallel transport from P to Q changes its components by 

8 .ZV, = (r« g r v + r.^rj s * 3 (40.07) 

Taking the T to be functions of the coordinates their change in going from 
P to Q is 


8x n 


The difference 
becomes 


= Si^v - 8 a ZVv 

(i- 


8 T, 


8x n 




(40.08) 

(40.09) 

(40.10) 


and is a tensor for arbitrary displacements Sa^. Therefore, the quantity 

8T, 


V- = 


8x, 


vtt _ 1 ’p T —T?T 

1 l4i P* ^-vg^IJip 


must be a tensor. 

Similarly one proves the tensor character of 

8T ,V ' 

== -—h ri' B r pv + f \ t ^ 


8x, 


■e 


wliere T" v is a contravariant tensor and of 

where T‘ is a mixed tensor. 


(40.11) 


(40.12) 


(40.13) 


f The tensor character of (40.05) can also be proved without use of the notion of parallel 
transport. To do this one starts from (37.06) and proves the transformation law (42.04) for 
Christoffrl symbols. One then transforms (40.05) to new variables using the two trans¬ 
formation laws (42.04) and (37.06). 
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By applying the rules of covariant differentiation to either the covariant 
or the contravariant components of the metric tensor expressions (40.11) and 
(40.12) can be shown to vanish. This fact is known as Ricci’s lemma. Because 
of its importance we shall consider its* formulation in some detail. 

If we put T = < 7 ^ equation (40.11) gives 

- fA - sun. - 0 (40.14) 

. . 3(7 

for, written in the form —— = T v> ^ + P^ vj} (40.15) 


this equation is an obvious consequence of the definition (39.22) of the P^ 
and was already stated in (39.28). 

If we put T'^ — g lL '“ in (40.12) we get 

a<7 tiV 

V v = f + = 0 ( 40 - 16 ) 

OXq 


This can be verified by using the explicit expressions (38.31) for the Then 
(40.16) reduces to 


d<r 3(7 

+ g w 9 va = 0 


Sx, 


BXn 


and this is equivalent to 


8yT 

8x n 


+s "^ = A {roJ=0 


(40.17) 


(40.18) 


Finally, if T* = 81 ), the expression (40.13) reduces to zero by virtue of the 
symmetry of the with respect to their lower indices. 

Covariant differentiation of a product of two tensors obeys the same rules 
as ordinary differentiation. This follows directly from our method of deriving 
the expression for a covariant derivative, for the rule was obtained from the 
expression for an infinitesimal increment for which the ordinary rule for 
differentiating a product is valid. 

Let us verify the product rule in the differentiation of the product of two 


vectors. Inserting 

T = Z7 V 

(Jtv [X V 

(40.19) 

into (40.11) we get 





(40.20) 

and, using (40.05), 



v e (£W 

= (v 3 tg ■ f v + tyv e F v) 

(40.21) 


Applying the product rule to the expression 

U, = 9^ 


(40.22) 
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and using the fact that the covariant derivatives of the metric tensor vanish, 
we obtain 


(40.23) 

Thus we see that in covariant differentiation the g behave as constants so 
that they can be taken outside the differentiation symbol. In other words 
it is immaterial whether indices are raised (or lowered) before or after covariant 
differentiation. 

We have just stated explicitly the expression for the covariant derivatives 
of a vector and of a second rank tensor. The covariant derivative of a scalar 
is the same as the ordinary derivative 

V 3 <1> = (40.24) 

GXq 

This quantity is, as we know, a covariant vector. 

For completeness we write down the general expression for the covariant 
derivative of a tensor of arbitrary rank 

= (40.25) 

which has m contravariant and k covariant indices. We have 


iff 

(v) 


VgC/.^ = — (u^h + muw*-** +... + r^m x V'^ m - lP 

p (v) cx & (yr pP (v) pP (v) 


— r p 

Pvi pv*...v* 


— r p 


(40.26) 


In each term of this sum one index of the tensor being differentiated becomes 
an index attached in the same position to the F-symbol, while in its place the 
tensor itself carries a summation index which is repeated in the opposite position 
as one of the indices of the accompanying T-symbol. One of the lower indices 
of r is always the label of the coordinate with respect to which one is differ¬ 
entiating. The terms in which a change in a superfix of the tensor is made enter 
with a plus sign and those in which a suffix is involved have a minus. 

Sometimes it is convenient to use a special notation for the operation of 
covariant differentiation combined with a raising of the suffix. The operation 


V* == <7 a % 

may be called contravariant differentiation. 


(40.27) 


41. Examples of Covariant Differentiation 

We shall now apply to some special cases the rules of covariant differentiation 
obtained above. 

First we calculate the divergence of a vector. By (40.06) the expression 
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is a mixed second rank tensor. Contraction with respect to p and v gives the 
scalar 

£) Jv 

Div A = (‘41.02) 

OX v 

This is a generalization of the expression (21.24) for the four-dimensional 
divergence. It can be simplified as follows. We have 




1 (%» 

2 \fa a 


dx s 


HOC 


%va \ 

faj 


1 8q 

_ „i xv t'lJ-y 

2 J 8x 

a 


(41.03) 


and if g is the determinant formed from the g and if in it gg' L ' t is the minor of 
p |iv , then the rule for differentiating a determinant gives 


whence 

We note that by (40.18) we 


dg = gg^dg^ 

l = -V %£ 
g 8x a 8x a 

also have 


1 _% = _ 

9 fa* ~ 8 \ 


and therefore = 

Insertion into (41.02) gives 

V v ^ = 


}_8g_ 
2 g fa* 


~kV(-s) 


Vi~9) 8x s 


{Vi-9)-^} 


(41.04) 

(41.05) 

(41.06) 

(41.07) 

(41.08) 


We see that the divergence of a vector A multiplied by V(~9) * s equal to the 
sum of partial derivatives with respect to the coordinates of Vi~9) times the 
contravariant vector components. 

If for our vector we take the gradient of some scalar <p, putting 




so that 


A v = g^ 


8cp 

fa., 


the divergence of this vector is the invariant expression 


(41.09) 

(41.10) 


□<P = 


1 8 
Vi~9) 8x v 



(41.11) 


This is the generalization of the d’Alembertian (21.27). 

The same divergence may also be calculated by a different method. We 
form first the covariant derivative of the gradient of <p. By the general rule 
(40.05) we get 

» _r° fi 

8x 8x ^ 8x 

IX v a 


(41.12) 
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and the expression is symmetric in p and v. It can be called the covariant 
second derivative of the scalar <p. If we then form the invariant 

(4i.i3) 

d 2 cp do 

we obtain Op = -P* — (41.14) 

dx^ 8x v dx a 

where we have put for brevity P* = g 11 T“ v (41.15) 

Since both (41.11) and (41.12) give the divergence of one and the same vector 
they must be equal. It is immediately obvious that the coefficients of the 
second derivatives of 9 are equal, but equating the coefficients of the first deri¬ 
vatives leads to the identity 

This identity could also have been verified directly. 

We note that in a system of coordinates in which x 0 , x v x 2 and x 3 are four 
solutions of the equation n<p = 0, the P* will vanish. Coordinates of this kind, 
which in addition satisfy certain conditions at infinity to be discussed in a 
later chapter, are called harmonic. In ordinary Relativity Theory cartesian 
spatial coordinates together with time are harmonic. 

If a vector A is the gradient of some scalar the difference of covariant 
derivatives V A v — V y A^ vanishes. In general this difference does not vanish 
and can be considered to be the four-dimensional generalization of the curl 
or vorticity of the vector field. We putf 

(Rot^ v = V^ v - V v ^ (41.17) 

By its definition this is an antisymmetric covariant tensor of second rank. 
Using equation (40.05) we can easily see that in the difference (41.17) the terms 
characteristic of covariant differentiation cancel and we have 


(Rot A\ y 




(41.18) 


which is the same as the usual expression for vorticity. The latter is thus valid 
in arbitrary coordinates. 

Let us now write for the symmetric part of the covariant derivative of 
the vector A w We have 

= \ ( V A + V ,A) (41.19) 

or, using (40.05) ^ ~ T^A a (41.20) 

With the aid of the symmetric tensor A [iy we can define the covariant second 
derivative of the vector A^ by 

V A = V A 4-V A - V A (41.21) 

ptv a v 1 xa 1 n vo optv V > 


t The symbol Rot will be used for the four-dimensional vorticity; the symbol curl is 
retained for the three-dimensional case. 
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Using (40.11) to express the derivatives on the right-hand side we get 


8x 8x 2r ^ A e° + dx J 




Hv pa 1 y„ ' Hv p/ 

VJ, a v/* v 

(41.22) 

Similarly the covariant second derivative of a contravariant vector is given by 

„ 8A a „ 8A * „ 8A P SFp, , 

+ + r ?« TFT - — + -d 2 A * («-23) 


, d A c 

v A p = -— -2- 
■“ 8x„ dx. 


dx .. ga; ** 0 x 1 0 r 

[i. a a 


We now consider the divergence of a second rank tensor, which we shall 
write in contravariant form. According to (40.12) we have 


8T ^ 

V yuv __i. pn ppv , pv pup 

v 8x ^ 1 pv 1- pv 

Transforming the third term with the aid of (41.07) we can write 


V T ,xv = 


1 


Vi-9) dx . 


{V(-g)-T»}+ 


(41.24) 


(41.25) 


Putting r ! ' v = y 4 ’' and remembering that the covariant derivatives of the metric 
tensor vanish we again get the identity (41.16). If 7* |rj is antisymmetric the 
last term in (41.25) vanishes, so that the divergence of an antisymmetric tensor 
is like the divergence of a vector in being a sum of derivative terms. For a 
general tensor, on the other hand, such a representation does not exist. 

We now write down the expression for a covariant derivative of a co variant 
second rank tensor together with two expressions obtained from the first one 
by cyclic permutation of suffixes : 


8F 

V F =_— — ]> F —T p F 

V|IV 8x x HO pv * VO - 1 HP 


. rp f — r p f 

vjx pa apt vp 


8F 

V F = —— 

* vo d X[L 

8F 

V F =_ — — pp F —- F p F 

v OH 8x av PH HV op 


(41.26) 


Assuming that F is antisymmetric, i.e. 


we find that V„F„ 


+ v. 


F = - F 

(XV VfX 

„ 0 JF 8F . 

+ V F = —^ + 

> V Ok l 


8x„ 


8x„ 


+ 


&c 


(41.27) 

(41.28) 


because the terms not involving derivatives cancel in pairs. The expression 


8F 8F 
F...._ = + ■ 


8x„ 


8x„ 


+ 


8F 

op 

8x„ 


(41.29) 


which is antisymmetric in all three suffixes is therefore a tensor of third rank 
which may be called the cyclic derivative of the antisymmetric tensor F . 
W e have previously encountered such an expression in Section 24. 
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The cyclic derivative of any antisymmetric tensor is related to the divergence 
of the antisymmetric pseudo-tensor dual to it. As in (37.67) we introduce the 
dual pseudo-tensor by the relation 


* 1 

= _ E^ 8 F y8 
2 


so that 


* 


F„ 


V(-9)-F 10 = Fn; 

3 = ^io ; V(-9)-f 31 = F i0 , 

We also introduce the pseudo-vector dual to F^ 


V(-9)-F* 0 = F 12 
V(-9)-F 12 = F 30 
as in (37.68): 


* 1 

Pp — _ Pp v^F 

g HVO 


(41.30) 


(41.31) 


(41.32) 


so that 


V( ~9) ’ F° = --Fm; 

23 #'> \/( 9)'F 2 — F ai0 ; \/( 9)'F 3 = F 120 


V(~~9)'F 1 — F 
If F^ is the cyclic derivative of F we have 


(41.33) 


1 


8{V(-9)-F^} == * a 


V(~9) 8x n 


(41.34) 


* » 
i.e. the pseudo-vector F* is the divergence of the pseudo-tensor F**. 

The same result is easily obtained without considering individual com¬ 
ponents by using the fact that the covariant derivatives of E^ 8 and also of 
^oc»ys> calculated by the general rule, vanish identically. 

42. The Transformation Law for Ghristoffel Symbols and the Locally 
Geodesic Coordinate System. Conditions for transforming ds* to 
a Form with Constant Coefficients 

Covariant, or tensorial, derivatives differ from ordinary derivatives by 
terms involving the Christoffel symbols 




-H 




3xn 


dx„ 


8 9« p 
8x, 


xg\ 


(42.01) 


If at some point x p = all the Christoffel symbols vanish, the expressions for 
tensorial and ordinary derivatives are the same. We shall show that in the 
neighbourhood of any point it is possible to introduce a system of coordinates 
such that all the r* 3 vanish at the point. Then, by (40.14) and (40.16) all 
derivatives of the metric tensor with respect to coordinates will vanish at the 
point in question. 

We first establish the law according to which the Christoffel symbols trans¬ 
form when one passes from the original system of coordinates (x 0 , x v x 2 , x 3 ) 
to some new system ( x 0 , x' v x' v x'fj. This law could be derived directly from the 
definition (42.01) of the using also the transformation law for the metric 
tensor. It is, however, simpler to argue as follows. We know that the quantities 




<? 2 <p 

dx^ dx s 


_r« if 


(42.02) 
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represent a tensor. This means that for any function (p and any coordinate 
transformation the equations 

d \ ™ 8 ? / 02( P w 8 <?\ /AO AO\ 


8x 8x 

H v 


— re — 
•* v 


/ a2( P _ f ra y M ^ ^3 / 42 03) 

\&»« &T y 


are valid where the (T^)' are Christoffel symbols in the primed coordinate 
system. Putting 9 = i^we get 


8x 8x 

|i V 


8x' 8x Bx'n 

— rp — = _ ^r° y — —2 

^ 8x v 9x &c 

P (i V 


(42.04) 


This is the required transformation law. The presence of the term involving 
the second derivative shows that r " 3 is not a tensor. If the transformation 
under consideration is linear the term mentioned is absent and, therefore, for 
linear transformations the Fj] v behave like a tensor. 

Let the values of the Fj[ v at the given point be v ) 0 . The primed quantities (F“ 3 )' 

will vanish at the point if the transformation of coordinates satisfies the relations 


/ aV \ t8x'\ 

These relations will be satisfied if we put 


(42.05) 


x 'o = x o ~ + \ ( r £v)o - <) K - (42.06) 

For this transformation 

(l$. =s s (42 - 07) 

and, therefore, the values at the point of the components of any tensor will be 
the same in the primed and the unprimed system. In particular there is no 
change in the components of the metric tensor ; the derivatives of this tensor 
on the other hand all transform to zero. This fact may be utilized to simplify 
calculations involving tensors, for it follows that if some quantity is known 
to be a tensor and also known to vanish if 8g^J 8x a = 0 the quantity must 
also vanish without that condition. 

It can be proved that with a suitable choice of coordinate system the deriva¬ 
tives of the may be made to vanish not only at one point, but even along 
any given line [14]. 

A coordinate system in which the derivatives of the g vanish at a given 
point is called locally geodesic. This name is justified by the fact that in such 
a system the equations of a geodesic reduce at the point in question to the 
statement that the second derivatives of the coordinates with respect to the 
parameter p vanish. (In the vicinity of the point these derivatives will be 
first order infinitesimals.) Hence the coordinates there will be linear functions 
of p up to terms of third order. 

We now ask the question: What is the condition for the existence of a co¬ 
ordinate system (x' 0 , x' v x v x’ 3 ) in which the Christoffel symbols vanish not only 
at the given point or along some line, but even in some finite region? 

If such a coordinate system exists the equations 


0 2 <p 

8x^ 8x w 



(42.08) 
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must have a solution, because they are satisfied by the functions 

<p = a"o; ? = % ; <p = * 2 ; <p = 4 (42.09) 

For all the equations (42.08) to be compatible it is obviously necessary that 
expressions for the same third derivative calculated from one or other of the 
equations must coincide. We have 


e / 82f -p \ = J_/p P M 

a» p / 

g / 82 9 \ = / pp _^P\ 
a® v \ 8xJ 


(42.10) 


Here the left-hand sides are equal, therefore also the right-hand sides. We 
equate the latter, perform the differentiations and express the second derivatives 
in terms of the first, getting 


( fH? _ ^2 q_ r° r p - r° r p \ ^ = o (42.11) 

\ 8x a 8x „ ^ •* v 01 ^ 0 7 8x p K ’ 

These equations must be valid for 9 = x' 0 , 9 = x' v 9 = x' 2 and cp = x' 3 . Since 
the determinant 


D (x’ 0 , xf xj x 3 ) 
D {x a , x lt x 2 , x 3 ) 


(42.12) 


is not zero all the coefficients of the d<p/&c p in (42.11) must vanish. These 
coefficients are 


8T P cF? 

ftp — _22_i£2 ! pa pp _ pa pp 

H, VOt O c, ‘ tiv (M ‘ ;j.a av 

OU OL 

We now prove that the conditions 


(42.13) 


*U = 0 (42.14) 

are not only necessary but also sufficient to ensure that the system of equations 
(42.08) has a solution. To do this we put 


and write (42.08) as 


8'f 



£?v 

8x„ 


re CP 

avrp 


(42.15) 

(42.16) 


Let the values of the cp v be given at some point which has the coordinates a;". 
To get the values for an arbitrary point x a we join the tw r o points by some curve 


*„ = 7 (p) (42.17) 

where p is a parameter, and we consider <p v , and also as functions of p. The 
o y are then determined by the system of ordinary differential equations 


<^Pv 

dp 



(42.18) 


where the dot denotes differentiation with respect to the parameter p. 

This system uniquely determines the values of <p y at the end point of the 
curve. It remains to show that the values of <p v so found do not depend on the 
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shape of the curve joining the initial and final points. To do this we consider 
an infinitesimally differing curve joining the two points : 

*„ = TO) + §TO) (42.19) 

Here 8 | a is an infinitesimal vector which becomes zero at the initial and final 
points. We denote the values of cp v on the varied curve by cp v + 8 <p v . 

The equations for <p v + 8 <p v will evidently be of the form 

^ (9v + 8<Pv) = ( r £v + S5“) + S 9p ) (42.20) 

Subtracting (42.18) from this and neglecting higher order infinitesimals, we 
get 

~ (§?,) = + I> p 8!^ (42.21) 

Using (42.18) once again and introducing the notation of (42.13) we can put 
this into the form 


^ (&Pv - r£,<Pp85‘) = K + I^TO e - I> 0 8?-) (42.22) 

For brevity we write = 8 <p v — r£ v <p p 8 i; a (42.23) 

Then, if (42.14) holds, (42.22) takes the form 

(42.24) 

Thus the equations for y) v have the same appearance as those for <p v ; both 
represent equations of parallel transport of a vector along the chosen curve. 
The 7 } v , just as the ep v , are uniquely determined by the initial conditions. But 
the initial conditions for the t] v are simply that they vanish. For the coordinates 
of the initial point are fixed, so that there 8 £“ = 0 , and the value of cp v at this 
point is also fixed, giving 8 <p v = 0. Hence y) v = 0 at the initial point; with 
this condition it follows that along the whole curve 

ij, = 0 (42.25) 

and therefore 8 <p v = rP v 9 p 8 ^ a (42.26) 

We now consider the final point of the curve. Its coordinates are also fixed, 

giving 8 £ a = 0, consequently, by (42.26) we must also have 8 <p v = 0. This 
means that the function takes on the same value at the final point whether it is 
calculated using the initial curve or the infinitesimally varied one. Deforming 
the curve continuously we shall then also obtain the same result for any two 
curves joining the same two points, not only for curves differing infinitesimally. 
It follows that in any simply connected region the <p v are single valued point 
functions determined by their values at an arbitrary initial point. 

By virtue of the differential equations (42.16) we have 


SX^ 0£ v 

because the are symmetric in their lower indices, 
expression 


dy — 9 V dx. t 


(42.27) 

It follows that the 

(42.28) 
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is a total differential. Integrating and fixing the constant by the value of 9 
at the initial point completes the determination of 9 . 

We have proved that the necessary and sufficient condition for a solution of 
equation (42.08) to exist is the vanishing of (42.13), and that 9 is determined 
apart from an additive constant by the values at one point of its partial 
derivatives with respect to the coordinates. 

If two, not necessarily different, solutions 9 and ^ of equations (42.08) are 
given, the expression 


8(p 81/ 

1 <L V = —- — 

^ 9 6x v 8x „ 


(42.29) 


remains constant as a consequence of these equations. To prove this it is 
sufficient to form the covariant derivative of the scalar (42.29) and to verify 
that it vanishes. (As we saw, (42.16) is just an expression of the fact that the 
covariant derivatives of the 9 V vanish.) 

We denote by x' 0 , x' v x 2 and x 3 four solutions of (42.08) which are so chosen 
that at the initial point one has 


dx' 8Xr, 

qi LV - a -3 __ g g 

dx^ dx v a aS 


(42.30) 


(no summation over a). Then (42.30) will also be true for all values of the 
coordinates. Hence it follows by purely algebraic reasoning that 

•sp dxi dxl 

9 -»-2>ar5T (43 ' 31) 

i.e. that can be represented in the form (35.15) and that therefore ds i can 
be brought to the form (35.09). 

Thus the necessary and sufficient condition for the quadratic form 

ds 2 =b < 7 iS dx a fZxg (42.32) 

to be reducible to 

ds 2 = (dx' 0 ) 2 — {dx[Y ( dx 2 ) 2 — (dx'f)* (42.33) 

is the vanishing of the expression 


ere ere 

7 ?p _ UV _ * Ha , po p p 

^' va 8 x 8X '*« 


po pp 
(i.a ov 


(42.13) 

formed out of the Christoffel symbols belonging to the quadratic form (42.32). 


43. The Curvature Tensor 

In the foregoing section we introduced the quantity 


T , er<> ere „ „ 

Pp _ _!"_5f2f _l_ po pp 

11 ’ va dx 8x |1V 001 


po pp 
pta ov 


(43.01) 


This quantity plays an important role in general tensor analysis and in the 
theory of gravitation. It is thus necessary to investigate its properties in 
detail. We first prove that it is a tensor. This can be done in different ways. 
The most direct method of proof consists in differentiating the equations 

d % _ p P = _,p 0 Y ^—§ 

dx^ 8x v |iV dx p a@; dXp 8x v 


(43.02) 



General Tensor Analysis 


151 


which state the transformation law.for Christoflel symbols as it was given 
in (42.04). The result of differentiating (43.02) with respect to x x can be written as 


d z x'„ / dx' 8 2 x'„ dx' d 2 x o dx 8 2 x„ \ 

/po y |_? _[f_ i _® _P_ i _a_p_| 

\ 8x x dx ^ 8x v 8x ^ cte v dx x 8x v 8x x dxj 


8x x 8x ^ 8x v 


/are 

= (- 1 - + r r ix 

\ 8 x x ^ ^ tX - 


)£:-( 


fW 

dx' 


+ (IX)' ( r ?c) 


{43 . 03) 

/ 8x x 8x^ 8x v 


Here the left-hand side is symmetric in X, u and v and therefore the right-hand 
side must also have this symmetry. Interchanging X and v on the right and 
equating the expression obtained to the original right-hand side, we get an 
equation which, in terms of (43.01), can be written as 


_ dx' 
dx f 




dx X dX n Sx v 


(43.04) 


This equation indeed expresses the fact that vX is a fourth rank tensor 
which is covariant in p, v and X and contravariant in p. This tensor is called 
the curvature tensor. 

With the aid of the curvature tensor one can give an expression for the 
change of a vector in parallel transport around an infinitesimal closed contour. 
Let the value of the vector A p at the starting point a£ be (A p ) g . If it is trans¬ 
ported to an adjacent point its components will take on the values 


= (A p )o + (T°J 0 (A a )o (x a - x») (43.05) 

apart from quantities of second order of smallnessf in the x a — a£. It is thus 
obvious that the changes A.A^ undergone by the vector components A when 
going round an, infinitesimal closed contour will be at least of second order 
in the greatest displacements occurring. These changes can be expressed as 
the line integral 

AA tl = J r^A p dx v (43.06) 

taken around the contour in question. For an infinitesimal contour the A p 
under the integral may be replaced by (43.05) and the T p v by 

^, = (r p J„ + (|^) o (r a -rO) (43.07) 

Inserting (43.05) and (43.07) into (43.06) and using the fact that the integral 
of a total differential around a closed contour vanishes, we get 


A A 




The quantities 

<? va = J(®« - 


*2) dx v = ^ | [{x a - x° a ) dx y - (x v - x») dxj 


(43.08) 


(43.09) 


t Such second order quantities will depend on the nature of the curve along which the 
displacement is made, because the vector .dp is not considered as point function. 
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can be interpreted as the projections of the surface enclosed by the contour 
on the coordinate “ pianos It is easy to verify that they form a contra- 
variant antisymmetric tensor. Owing to this antisymmetry equation (43.08) 
can be written as 

^ = (43.10) 


Here we have suppressed the suffix 0 which characterizes the initial point. 

By similar arguments the change in the contra variant components of a vector 
can be shown to be given by 

AA° = - i/% (43.11) 


We already know that R^ va is a tensor; this can also be seen from (43.10) 
and (43.11), for AA fi is the difference of two vectors referring to the same 
point and is thus itself a vector. On the other hand, if an arbitrary symmetric 
part is added to the antisymmetric tensor Q va equation (43.10) does not change. 
Therefore the right-hand side of (43.10) is a vector for any tensor Q vot and any 
vector A . This is possible only if R^ m is a tensor. 

In Section 40 we introduced the operation of covariant differentiation. In 
general, covariant differentiations do not commute, the second covariant 
derivative of a vector or a tensor taken first with respect to x 3 and then to 
x a is not equal to the second derivative with respect to x a and then x {i . We now 
consider the difference of these two derivatives for the case of a vector. Accord¬ 
ing to (40.05) we have 


VA = — 
0 “ &r 0 


rv A 

1 


(43.12) 


We calculate in a locally geodesic coordinate system, which is one in 

which the first derivatives of the g vanish at the point considered. We have 


8 2 A 

V v A — _H- 

“ 0 ^ “ cte 8x, 


_ d Ik A 

dx„ A ' J 


(43.13) 


( arv d rv \ 

< 43 - 14 ) 

In the locally geodesic system the factor of A v on the right-hand side coincides 
with the expression for the curvature tensor 


R: 


6 >P 


li. aS 


8Y‘ 
d Xfi dx a 


aft 


+ r; 


r v 

o(3 


ro pv 
A act 


(43.15) 


Therefore, in a locally geodesic system the equation 

(V a V p - V 3 V a )4 = ^, a9 A v (43.16) 

is valid. Now, both sides of this equation are tensors and therefore (43.16) 
must also be true in an arbitrary system of coordinates. The expression (43.16) 
for the difference of second derivatives of a veotor is thus of general validity. 
The corresponding equations for a contravariant vector 

(V a v 3 - V p v a )^= -K* a * < 43 - 17 ) 

can be derived similarly. 
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The technique used here of first stating an equation between tensors in a 
locally geodesic system and then concluding that it is generally valid is very 
useful for abbreviating the derivation of results. The equations (43.16) and 
(43.17) are so simple that they can also be obtained quite easily without using this 
technique, but there are other cases where the simplification is more substantial. 

Let us, for instance, consider the expression for the difference of second 
covariant derivatives of a tensor of arbitrary rank. 


_ jjv- i • • •. t% 

(v) v ! . . . , V* 

Equation (40.26) for the first derivative can be written in the form 


(43.18) 


r\ Til 

= —(u^h + y iyt 

P (v) <y)' /w PP (v) 

p 4=1 

-Vr^ u ( ' x) 

Z, V 1 ...V j „ 1 pv # + 1 ...V i 

4=1 

Forming the second derivative in a geodesic system gives 


(43.19) 



8 2 


m gpt** 

-(Uph + V—£§J7j*V 

dx dXr} dx 

a p <= l a 


. lPM-i + l • ■ • \^m 


k ppP 

y itiTjw 

dx v i • • 

3 = 1 a 


•Vj_lpVj-,l...V A . 


(43.20) 


Therefore, the difference of second derivatives in a geodesic system coincides 
with the expression 


< - VX 7 - 


_ y pPi jrji^i - - • M-i—lPt'-i-i-i — 

P> a P W 

+ 1 R P a L (ti) 

V J>“P v l--- v 3-lP v 3 + l--- v A- 


(43.21) 


But since both sides of this equation are tensors, (43.21) must be correct in all 
coordinate systems. 


44. The Basic Properties of the Curvature Tensor 

Side by side with the mixed curvature tensor 

w-w ( 44 . 01 ) 

we shall consider the covariant tensor 

a |3 = 9,oK a 3 (44.02) 

According to this definition the mixed tensor is obtained from 
by raising the second covariant suffix. Therefore, a more precise notation for 
it would be 

= ( 44 - 03 ) 

We note that in older literature the covariant and the mixed tensors used to be 
denoted by the symbols 


and 


= (l tV > a P) 
K. *3 = W’ a W> 


(44.04) 

(44.05) 
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which were known as Riemann’s four-index symbols of the first and second 
kind. 

Evaluating the covariant curvature tensor by (44.02) we get 
8 


R, 


“ v ’ af3 dx, 


+ t5.(w. v) - - rs.([p«. v] - 

(44.06) 


or finally 


R, 


[xv. a 3 




d \< 


d x v dXp 


+ 


d% 9. 




8*9, 


dx , 8x„ 


v(3 


dx u. 8x a 


8x„ dx, 


) - r ^«[ v P. p] + r ^[v«, p] 
(44.07) 


Here the last two terms can be expressed in terms of the T^ v giving 

R ^9 W + _ ^ 2 ff v 3 _ d 2 ^ a \ _ p P p 0 , p P p 

IW '“ 3 2\8x tl 8x 3 + 8x v 8x a 8x^ 8x a 8x v 8xJ 9p ° ■*“ v3 ^ ^ ' 


(44.08) 


From this expression the following symmetry properties of the tensor can be 
deduced : 


1. Antisymmetry in its first two suffixes 


R-J\i, aP = — -^pv, aP 

(44.09) 

2. Antisymmetry in its last two suffixes 


K, 3 <* ' K, aP 

(44.10) 

3. Cyclic symmetry 


^v, 0 tP R\m, Pv -^nP. va = Q 

(44.11) 


The first two properties are obvious. To verify the last it is sufficient to write 
the left-hand side of (44.11) in a locally geodesic system and to verify that the 
twelve second derivatives that occur cancel in pairs. 

From the three stated properties it follows further that we can interchange 
the first pair of suffixes with the second, without any change of order within 
either pair: 

R^=R^.a 3 ( 44 ‘ 12 ) 

This last property also follows directly from the definition (44.08); to show 
that it is not independent, but a consequence of the other three, we can add 
together equation (44.11) and three other equations obtained from (44.11) by 
cyclic permutation of the four suffixes (jr, v, a, (3). If the antisymmetry pro¬ 
perties 1. and 2. are used eight of the twelve terms in the sum cancel in pairs, 
the remaining four giving 

-2^3 + 2 ^a = 0 ( 44 ' 13 > 

This differs from (44.12) only in the naming of suffixes. 

We now count how many independent components the curvature tensor 
possesses ; we do this on the assumption that each suffix may take on n values 
so that actually n = 4. Evidently all four suffixes cannot be equal. All non¬ 
vanishing components having only two distinct suffixes reduce to the type 
R m . Their number is the number of pairs of unequal suffixes, i.e. \n{n — 1) 
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Further, if a triplet of different numbers, a, B, y is given—and there are 
\n{n — l)(w — 2) such triplets—the non-vanishing tensor components that 
may be formed from it are 7? lfj „ r; ; ^; R yx y(J which respectively have the 

first, second or third of the numbers a, and y repeated. The number of such 
components will be three times the number of triplets, i.e. \n(n — l)(n — 2 ). 
Finally, there exist ^jm(w — 1)(» — 2 ){n — 3) combinations of four different 
numbers, a, (3, y and S. With each such combination one can form components 
.R a(S , YS , R aS , ir and R ay 5(j , while all other combinations of suffixes reduce to 
these three. However, these three components are not independent because 
they are linked by the cyclic symmetry condition, leaving only two independent 
quantities. Therefore, the number of independent tensor components having 
four distinct suffixes is twice that of quartets of numbers, i.e. is equal to 
x^n(n — l)(n — 2 )(n — 3). The total number of independent components is 

^ n(n - 1) + ^ n(n - 1 )(n - 2) + ^n(n - 1 )(n - 2 )(n - 3) = ^n 2 (« 2 - 1) 

(44.14) 

In the case of interest n = 4 so that the number is 

6 + 12 + 2 = 20 (44.15) 

One might note that in three-dimensional space (n = 3) the number of indepen¬ 
dent components of the curvature tensor is 6 , which is also the number of 
components of a symmetric second rank tensor. In fact, for n — 3, the curva¬ 
ture tensor can be expressed in terms of a second rank symmetric tensor, as 
shown in Appendix E. Finally, the curvature tensor of a two-dimensional 
surface (n = 2) has only one component—the Gaussian curvature. 

We note that in Section 31 we have already encountered quantities with the 
same symmetry properties as the covariant curvature tensor and we observed 
there the connection of such quantities with Krutkov’s symmetric tensor. 

We have studied the properties of the covariant curvature tensor. The 
mixed curvature tensor has analogous properties 

K,^=~K* ( 44 - 16 ) 

and + + ( 44 - 17 ) 

corresponding to (44.12) and (44.11). As regards the property corresponding 
to (44.19) it takes on a somewhat more complicated form, namely 

<7ov<? P ^l« P = ( 44 - 18 ) 

(It means that the raising of the first suffix and the lowering of the superfix 
results in a change of sign of the component.) This property can easily be 
derived independently of (44.09) by comparing (43.10) and (43.11) or (43.16) 
and (43.17) or alternatively by use of the equation 

(V*V 0 - V 3 V o ) V = Rl ^ ^ = o (44.19) 

which follows from the general rule (43.21). 

In addition to the algebraic identities just discussed the curvature tensor 
satisfies some differential relations which can be written as 

V X^| iv , «0 + V v R X|i, ct0 = 0 


(44.20) 
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and are called the Bianchi identities. To verify them we introduce a locally 
geodesic coordinate system, in which covariant and ordinary derivatives can 
be identified. The calculations are simplified by the fact that in the expression 
(44.08) for jR |XV a(3 the terms not containing second derivatives are quadratic in 
the r£ v , so that not only these terms themselves, but also their first derivatives 
become zero in a locally geodesic system. In such a system the left-hand side 
of (44.20) is equal to 



(xv, ocfi 


+ a(i 

= l C ^ 

2 \fiz x 8x^ 8x ?j 8x v 8x x 8x a 


+ 


8 3 q 

cf[icc 

8x v 8x x 8x (j 


+ 8 ^ 4- 83 gyfi 

bx„ 8x„ 8x r , ' 8x y 8x„ 8x„ 

\). v p A n a 

+ -Jfes_) - 1 ..) 

8x xl 8x v 8xJ 2 


(44.21) 


where the dots denote terms of the same structure as those given but with 
a and (3 interchanged. Now 7 , the expression in the bracket is itself symmetric 
in a and S so that the two brackets are equal and their difference vanishes. 
This proves that (44.20) is valid in a locally geodesic system and thus, being 
a tensor equation, it must be valid generally. 

By contracting with respect to two indices one can construct a second rank 
tensor out of the fourth rank curvature tensor. Contraction with respect to the 
first two, or the last two suffixes evidently gives zero, all other contractions 
give one and the same result, apart from a sign. In this way we get the tensor 


K = ( 44 - 22 ) 

which is called the second rank curvature tensor or the Riemann tensor. It is 
easy to see that the Riemann tensor is symmetric. For, using (44.09), (44.10) 
and (44.12) we get 


a. gv = f „ = 9^, ali = ftx (*4- 23 ) 

or = R, IV (44.24) 

Side by side with the covariant Riemann tensor one considers also the mixed 
tensor 


- 9 w K 

and the contravariant tensor 

r ji = <r<r# PO 

Further contraction, with respect to p and v leads to the scalar 


which is called the scalar curvature. 

We now r calculate the divergence of the Riemann tensor, 

r v = 

Introducing (44.22) for R av we get 


(44.25) 

(44.26) 

(44.27) 


(44.28) 


(44.29) 
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To the covariant derivative in the above we apply the Bianchi identities written 
in the form 

VAa. f Vv, vA L A P = 0 (44.30) 

which is obtained from (44.20) with the aid of (44.12). This gives 

y v = vX - Xf4 (44.31) 

This relation can be rewritten as 


y v = -r v + v v i? 


(44.32) 


for if the summation indices in the first term on the right of (44.31) are renamed 
so as to exchange X with p and p with a it becomes the same as (44.29). In the 
last term of (44.31) the summation may be performed before the covariant 
differentiation, reducing that term to V v i2. Thus we find 


Y = -V R = - — 
v 2 v 2 dx. 


(44.33) 


in view of the fact that for a scalar the covariant and the ordinary derivative 
coincide. Comparing (44.28) and (44.33) we see that the divergence of the 
tensor 


G a.- J = K ~ - 0 ?J 


(44.34) 


vanishes identically. For this reason the tensor Cr (iV is called the conservative 
tensor. As it plays an important part in Einstein’s Theory of Gravitation it is 
also called the Einstein tensor. 

Further transformations of the curvature tensor will be postponed until 
Chapter V, which deals with the theory of gravitation. 



CHAPTER IV 


A FORMULATION OF RELATIVITY THEORY IN 
ARBITRARY COORDINATES 


45. Properties of Space-Time and Choice of Coordinates 

The form of the equations that describe the course of a physical process in 
space and time depends not only on the specific nature of the particular process 
but also on two further circumstances : the properties of space-time and the 
coordinates chosen. The properties of space-time are objective, they are deter¬ 
mined by Nature and do not depend on our choice. In contrast, the use of one 
system of coordinates or another is to a very high degree a matter of our choice. 
It is true that there is a certain limitation to arbitrariness, in so far as the 
existence of certain preferential coordinate systems, namely the Galilean ones, 
is only assured by reason of the objective properties of space-time; such co¬ 
ordinate systems would not exist if these properties were different. However, 
it is always possible, by a mathematical transformation, to pass from a pre¬ 
ferential coordinate system to any other arbitrary one. We have studied the 
rules for doing this in the preceding chapter. 

To detach oneself from the specific characteristics of any particular process 
one must consider those equations which are of greatest generality and which 
characterize most directly the properties of space-time. The equation that 
states the law of propagation of a wave front travelling with the limiting 
velocity is of this kind. In the first place, this law is obeyed by light, i.e. 
electromagnetic waves in free space, but we have already pointed out that it 
should not be regarded as referring specifically to light, but as a general law to 
which any disturbance travelling with limiting velocity is subject. The equa¬ 
tion for the propagation of a wave front in free space characterizes not only the 
properties of the kind of matter being propagated (e.g. of the electromagnetic 
field) but also the properties of space-time itself. (We have repeatedly pointed 
out that the practical measurement of great distances is based on triangulation 
and on radar, i.e. also on the use of the laws of electromagnetic wave propaga¬ 
tion.) Consequently, the concepts of geometry and the notion of time are very 
closely connected with the law of wave front propagation in free space. 

In Galilean coordinates 


x' 0 = ct; x x — x\ *2 = y ; x^ — z 
this law is expressed by the equation 


id w\ 2 

/Sw\ 2 

id^y 


lav 

l dx'J 

\dx'J 

\dxj 


(45.01) 

(45.02) 


where co = const, is the equation of the moving surface forming the wave 
front. Equation (45.02) is a mathematical statement of the fact that the 
wave surface moves in the direction of its own normal and with the speed of 
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light. If we identify the normal to a wave surface with a light ray and consider 
the point of intersection of ray and front, equation (45.02) permits us to assert 
that in Galilean coordinates this front moves along the ray in a straight line 
and uniformly with the speed of light. 

Side by side with the propagation of a wave front we can consider the 
simplest process in which a motion with a speed less than that of light is 
realized. This is the free motion of a mass point. In the form due to Hamilton 
and Jacobi its equations of motion take on a form similar to (45.02), namely 


doi \ 2 / do> \ 2 I d(x>\ 2 / \ 2 

dx'J \ dx'J \ dx'J 1 8xJ 


(45.03) 


where w is proportional to the action function. Indeed, equation (45.03) has a 
complete integral of the form 


co = a 3 x[ + a 2 x 2 + a 3 x 3 — + a \ + a l + a i )' x 'o (45.04) 


If the derivatives of to with respect to a v a 2 , a 3 are equated to constants the 
resulting equations describe uniform rectilinear motion with less than light 
velocity. Thus we can take (45.02) and (45.03) to be the equations which reflect 
most directly the properties of space-time. 

The differential expression 


(Vco) 2 = 


/dco\ 2 / 8o> \ 2 /dco\ 2 / dco \ 2 

\ dx'J \8xJ l dx'J \ dxj 


(45.05) 


which enters these equations characterizes both the properties of space-time 
and the physical significance of the coordinates. 

If we did not know the significance of the variables ( x ' 0 , x' v x' v x 3 ) and knew 
only that they were in some way related to space and time (or, mathematically 
expressed, that they served to arithmeticize space and time) we would be able 
to obtain their physical interpretation by considering processes described 
by the equations (45.02) and (45.03). In fact this is how we proceeded in con¬ 
structing the Theory of Relativity. We established the physical meaning of the 
variables (45.01) in two stages. First we made use of those not completely 
accurate concepts of space coordinates and of time which were accepted in 
classical pre-relativistic physics. The provisional definition of time linked the 
variable t with the motion of a clock or the course of any other periodic process 
and the definition of the space coordinates related the variables x, y and z to 
distances measured with the aid of solid bodies on the basis of Euclidean 
geometry, i.e. to a coordinate mesh. But we stressed from the start that these 
definitions needed to be made more precise. 

Indeed, the use of clocks at different points in space requires a solution of 
the problem of their synchronization and the use of rigid rods for measuring 
large distances is not only impossible in practice but also open to objections of 
principle. 

The definition of physical quantities is never arbitrary; it always reflects 
Nature with greater or lesser accuracy. Therefore it is possible to make defini¬ 
tions more precise only on the basis of a deeper knowledge of Nature. The 
particular definitions we are here concerned with, those of the space and time 
coordinates, are supported not only by the law of the uniform rectilinear motion 
of a free body, already invoked by Newton—but also by the firmly established 
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law for the front of a light wave, i.e. by (45.02) and (45.03). These were just the 
lines we followed in developing the Theory of Relativity which is essentially a 
theory of space and time. (The designation “ Relativity ” has historic reasons 
and gives only a one-sided idea of the content of the theory.) The Lorentz 
transformations, so fundamental for the Theory of Relativity as the expressions 
of the nature of space and time and of the precise meaning of the variables 
x, y, z and t, are indeed derived from the law of wavefront propagation, together 
with the requirement of conservation for uniform rectilinear motion ; they are 
not given a priori, before this law is established. 

Thus both the properties of space-time and the significance of the Galilean 
coordinates (45.01) are derived from equations (45.02) and (45.03). 

Both the wave front equation and the Hamilton-Jacobi equations for a free 
mass point may, however, be written in more general form : 


and 


sr 


8x do) 

■»____ 

dx^ dx„ 
do) do 

,yJ i 


= 0 


= 1 


(45.06) 


(45.07) 


and the statement just made remains true : both the nature of space-time and 
the significance of the coordinates ( x 0 , x v x 2 , x 3 ) may be determined from these 
equations. 

Let us assume that the g'- L '‘ are given functions of their variables. Which 
properties of these functions reflect properties of space-time itself and which 
are determined merely by the particular choice of coordinates? 

If we admit that the objective properties of space-time are correctly described 
by the usual Theory of Relativity, these properties may be expressed by the 
statement that there exist Galilean coordinate systems in which equations 
(45.06) and (45.07) assume the forms (45.02) and (45.03). 

This assertion means that there exists a substitution 



’K = f*( x o> x i> x 2> x s) (a = 0, 1, 2, 3) 

(45.08) 

which reduces the 

expression 

do> do) 

(45.09) 


(Vm) 2 = — 

dx^ dx^ 

to the form 

2 L / do) \ 2 

(V “) - 2 “\u) 

(45.10) 


We know from general tensor analysis that a necessary and sufficient con¬ 
dition for the possibility of reducing the quadratic form (45.09) to one with 
constant coefficients is the vanishing of the fourth rank tensor of curvature 
formed from the y xv and the related g . In order that the transformed quadratic 
form should have one plus sign and three minus signs attached to the squared 
partial derivatives, it is necessary, in addition, that the inequalities stated in 
Section 35 be satisfied. 

Thus, the equations that must be satisfied by the are 

g *o > o ; f < 0 

yt-l 


and 


(45.11) 

(45.12) 
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where £i, £ 2 and are arbitrary numbers. These, then, are the equations which 
express, independently of any choice of coordinates, the properties of space¬ 
time in the approximation in which the ordinary Theory of Relativity is valid. 

If these equations are satisfied, the form of the required transformation 
(45.08) is determined uniquely except for a Lorentz transformation. The 
quantities x' a =/ a must therefore be interpreted as Galilean coordinates, and 
this also gives the interpretation of the variables (a; 0 , x v x 2 , x 3 ), in which the 
equations were originally stated. 

In the foregoing discussion we assumed that the g iv ' are given functions of 
the coordinates, but we can also adopt a different point of view and consider 
them as unknown functions subject only to the conditions (45.11) and (45.12) 
which express the nature of space-time. 

Solving these equations one finds for the covariant components of the metric 
tensor expressions of the form 


Y ofktfk 

^' k ds. dx r , 
k“ l * i 1 


(45.13) 


involving four arbitrary functions /*. (The appearance of arbitrary functions in 
the solution is connected with the fact that the equations for the y a(J are covariant 
with respect to arbitrary coordinate transformations.) One or other choice 
of the arbitrary functions ft can in no w r ay influence the physical consequences 
of the theory, for it corresponds to a purely mathematical transformation of 
the equations from one set of independent variables to another. It is, however, 
appropriate to limit the choice of arbitrary functions in such a way that the 
permitted transformations form as restricted a group as possible and that the 
basic equations appear in the simplest possible form. If such a limitation is 
possible—and this does depend on the objective properties of space-time—the 
preferred coordinate systems obtained in this way will have a more direct 
physical interpretation. In the case here considered a Galilean system is so 
preferred; it is obtained if in the general solution of (45.13) we put/* = x*. 

It is, nevertheless, also possible to leave the functions /* indeterminate and 
to formulate the equations for physical processes without presupposing a 
solution of the problem of choosing independent variables. Then, for instance, 
the law of wave front propagation will be expressed by the system of equations 
(45.06), (45.11) and (45.12) for the unknown functions co and y' ,v \ The law of 
motion of a free mass point is expressible similarly. Other examples of such a 
generally covariant formulation follow' in the next section. 


46. The Equations of Mathematical Physics in Arbitrary Coordinates 

If for some system of differential equations of mathematical physics the 
tensor form in Galilean coordinates is known, the corresponding equations in 
arbitrary coordinates are obtained simply by replacing ordinary by covariant 
derivatives. This rule is even applicable to equations involving second or 
higher derivatives because, if R = 0, covariant differentiations applied 
to any vector or tensor are commutative by the general rule (43.21). 

We consider first the equations of electrodynamics. 

Since the linear differential form (24.04) 

8 <p - A v dx v 


(46.01) 
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is an invariant the components of potential occurring in it form a covariant 
vector even for general coordinate transformations. The differential relation 
(24.05) which is imposed on the components of potential can be written as 


V v A v = 0 (46.02) 

where the A'“ are the contravariant components 

A v = gr^Ap (46.03) 

According to (41.08) an explicit form of (46.02) is 


vhi M - 9yA ’ >=0 (46 - 04) 

The connection between fields and potentials is given by 

= V v - V v 4 (46.05) 

but, as was already noted in Section 41, the terms in which covariant and 
ordinary differentiations differ, cancel when the difference in (46.05) is taken, 
so that for the antisymmetric field tensor we have the expression 


dx 8x 

[l V 


(46.06) 


which is the same as (24.10). The first set of Maxwell-Lorentz equations, 
which in the usual notation has the appearance 


1 OH 

curl Ed- 

c dt 


0 ; 


div H = 0 


(46.07) 


can now be written as F = 0 (46.08) 

in agreement with (24.17), F^ m being the totally antisymmetricf third rank 
tensor which is expressible in terms of jP |XV as 

= VU + V- + (46.09) 

in generalization of (24.12). According to (41.28) this expression is equal to 


8F 8F 

F _ = . ' 


8x„ 


dx,. 


8F 

I 

dx 


(46.10) 


because F is antisymmetric. 

Thus equation (24.12) remains unchanged. We now write down the second 
set of Maxwell-Lorentz equations in arbitrary coordinates. In the usual 
notation this set appears as 


div E = 47:p ; 


curl H - - — 
c dt 



(46.11) 


According to (24.20) the left-hand side of (46.11) represents the contravariant 
divergence of the field tensor and, according to (41.25), it may be expressed 
in general coordinates as 


V v W ptv 


1 0 
V(~9) dx » 




t i.e. antisymmetric in all its indices. 


(46.12) 
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since the field tensor is antisymmetric. On the right-hand sides of (46.11) 
we have Aaz/c times the contravariant current vector, i.e. the vector 

47T 

S' “ = — (46.13) 
c 


where p* is the invariant density and u [L the contravariant velocity of the 
charge. Therefore, in general coordinates the equations (46.11) will appear as 

■ 77 —■ Fn = ** = (46.14) 

V(-g) 3 *v c 

and the law of charge conservation follows from them in the form 


V = ^^<VW)'»“)-0 (46.15) 

In order to write the generally covariant equations of motion of a charged 
mass point in an external field it is sufficient to find an expression for the 
vector of acceleration. If x is the proper time, the four-dimensional velocity is 


w v = -r-? (46.16) 

dx 

where iCiC = c 2 (46.17) 

The acceleration vector it/' will be given by the left-hand side of (38.34) so that 

d 2 x dx dx a 

= ^ + <**•“> 


This quantity is indeed a vector and in Galilean coordinates it goes over into the 
ordinary expression for acceleration. As for the expression for the Lorentz 
force, we have already stated it in a covariant form in Section 25. Therefore 
the equations of motion will be 


(46.19) 

where the are the covariant components of acceleration given by 

= V wV (46.20) 

In Section 33 we considered the energy tensor of the electromagnetic field 
and represented the density of Lorentz force as its (negative) divergence. The 
corresponding generally covariant equations are easy to state. 

The generalizations of the expressions for the energy tensor of the electro¬ 
magnetic field are 

(46 ' 21) 


and TJ^ = — - 1 - (46.22) 

4tc‘ 16tt‘ 

Using the Maxwell-Lorentz equations (46.08) and (46.14) we get, after some 
calculation: 


V U iLV = —q^F s' 

v V6C 


(46.23) 
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On the right-hand side we have here the negative of the Lorentz force density. 
The generally covariant equations of motion for a continuous distribution of 
charge can be written as in (33.09) in the form 




(46.24) 

where p* is the invariant rest-mass density which satisfies the equation 

V a (p* M °) 


(46.25) 

As for the acceleration w p , 

according to (46.18), it can be written as 



du p 

w p = — + 1 p x u V 

(46.26) 

or 

8u e _ 

ufi = up —f~ r p «°M a 

(46.27) 

The last expression is equal to 



Ilf = M°V 0 M p 

(46.28) 


where V n w p is a covariant derivative. 

If in analogy to (37.01) we introduce a mass tensor for the charged particles, 
namely 


© po = - 2 fX*M p M 0 


(46.29) 


and if we use the law of rest-mass conservation (46.25) we can write 

v* = c 2 V o 0 po (46.30) 

In the equations of motion (46.24) we can express the left-hand side in the form 
(46.30) and we can use (46.23) on the right. Then we get 

V a T pa = 0 (46.31) 

where c 2 T po = p*« p w° + U pa (46.32) 

which is c 2 times the mass tensor of the system of particles and field. 

In a similar way we can write down the generally covariant equation of 
motion of a continuous medium of the type of an ideal fluid. Denoting by 
p* the invariant rest-mass density, including that part of it which changes as 
a result of changes in energy of compression, we get, as in (32.17), for the 
complete energy tensor, i.e. for c 2 times the mass tensor: 

e?T pa = |p* + —^ju p u a - pg pa (46.33) 


This also satisfies the conservation law (46.31). We note that equation (46.25) 
will now no longer be satisfied but only a similar equation for a quantity p* 
related to p* by (32.25) and representing the invariant density of that part 
of the rest-mass which is conserved. 

In Section 31 we established a general rule according to which the mass 
tensor must be a function of the state of the system and cannot depend 
explicitly on the coordinates. This rule referred to Galilean coordinates, in 
which the g ^ have given constant values. We can retain this rule for arbitrary 
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coordinates if we agree to include among our functions of state the <T‘. (The 
identities (40.14) and (40.16), which express Ricci’s lemma, will formally play 
the part of field equations for the </^ v .) 

In all examples here considered, the mass tensor obeys this generalized rule. 
For instance, in (46.33) the functions of state involved are the velocity com¬ 
ponents u p , the pressure j>, the invariant density p* and the components of the 
metric tensor </ po . (Not all these functions arc independent.) In the energy 
tensor of the electromagnetic field the functions of state are the field components 
F a& and the \ 


47. A Variational Principle for the Maxwell-Lorentz System of 
Equations 

Many equations of mathematical physics can be formulated as conditions 
that a certain integral, called the action integral, take on an extremal value. 
One of the simplest examples of such a formulation is given by the equation 
of a geodesic considered in Section 38. We shall now investigate the rather 
more complicated example of the Maxwell-Lorentz system of equations which 
describe the motion of a continuous charged medium, the elements of which 
interact by means of the electromagnetic field. 

The Maxwell-Lorentz system of equations involves the field components 
F , the components « v of four-dimensional velocity, the invariant rest-mass 
density p* and the invariant charge density p*. They have the form 


dF 

M' 

8u\ 


8F- 


X|i 


&r,, 


+ 


dF\ 
dx,. 


vX 


= — p*M* 


4tc 

c 


(47.01) 

(47.02) 


and p*te a -f ? -F^iC ----- 0 (47.03) 

Here w a is the covariant component of the acceleration vector of which the 
contravariant components are 

w* = (47.04) 

The invariant densities p* and p* satisfy the equations 

V^p*,/*) = 0 (47.05) 

and V a (p*M a ) 0 (47.00) 

the second of which is a consequence of (47.02) 

We must seek to construct an action integral in such a way that variations 
with respect to the functions involved will give rise to the equations stated. 

We can simplify the task by the introduction of auxiliary functions chosen 
in such a way as to satisfy some of the equations identically; the variation 
must then give the remaining equations. 

With this aim we introduce the potentials and the positions of the particles 
of the medium, and express the remaining functions in terms of these. 

Putting 


(47.07) 
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we satisfy (47.01) identically and we shall regard the potentials A v as functions 
to be varied. 

For the description of the motion of the medium we introduce Lagrange 
variables a 1 , a 2 , a 3 and a 0 = p, where a L , a 2 and a 3 are, for instance, the initial 
coordinates of a particle of the medium and p is a parameter of the nature of 
a time. The functions to be subjected to variation -wall be 


=/*(?» ai>«2>«3) (47.08) 

For constant a x , a 2 and a 3 and varying p they give the motion of a particular 
particle of the fluid. A variation of these functions, i.e. a variation of the 
positions, will be called a displacement and denoted by : 

= 8/ a (p, « 1 , a 2 , a 3 ) (47.09) 


This notation is justified because such variations represent an infinitesimal 
contra variant vector. We shall, in general, think of the as functions not of the 
Lagrange variables (p, a u a t , a 3 ) but of the coordinates (x 0 , x x , x 2 , x 3 ) which 
are related to the Lagrange variables by (47.08). 

The components of the four-dimensional velocity can be expressed in terms 
of the Lagrange variables as 


u 


a 


(■WJZp) 


(47.10) 


so that the relation 


u u a = c 2 

a 


(47.11) 


is identically satisfied. In (47.10) the have as their arguments the x a =f a , 
so that when f a is varied the change in the g^ y must be taken into account. 

The introduction of Lagrange variables causes the equation of continuity 
(47.07) to be identically satisfied in terms of the expressions (47.10) for the 
velocity and the expression for the invariant density determined from the 
relation 

1 *V(-J)U - (47.12) 

Here I is the absolute value of the Jacobian 


^ D(x 0 , x lt x 3 , x 3 ) 
I)(p, a x , a 2 , a 3 ) 


(47.13) 


We denote by SjM® and Sjp* the variations due to the change in the form 
of the functions f a . The quantity u a + S pjP is the velocity of the varied motion 
at the point x a -j- On the other hand, the velocity of the varied motion 
at the point is 

OU a 

u “ + 8 u* = w“- l a + Spt* (47.14) 

8x a 

Thus the variation 8u a which describes the change of the velocity field is 


8U* = SjU 01 - 



(47.15) 
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We note that the variation S x w a is the difference of the values of two vectors 
taken at different points and is, therefore, not a vector. In contrast 8w a is the 
difference of the values of two vectors at one and the same point and is, there¬ 
fore, a vector. Similarly we can introduce variations Sjg* and 8p* for the 
invariant density. They are related by 


du* 

**=***-£*■ 


(47.16) 


For the calculation of we must find the variations of both sides of (47.12). 
We have 

,i IL m - 1 i 1 %>•'. v- .. 


s| L a A S M _ 1 (\ S A S A |„ 

HV V 1 ” 8 p/i V{9 a &( d fJ 8 P)( d fJ 8 P)}\ 2 dx 0 dp dp iL ° dp dp) 

(47.17) 

Inserting 

81° 81° 8f 

(47 - 18) 

and using the expression (47.10) for the velocities, we get 


(47.18) 




(47.19) 


Using the relation 


= ~ + r v ° p 5 p (47.20) 

for the covariant derivative we can write the foregoing equation in the form 

The variation of the right-hand side of (47.12) will be proportional to this 
expression. Further, the varied value of the Jacobian I will be 

r , * r D ( x o + ?°> x i + Z 1 ’ x 2 + ? 2 > x 3 + 5 3 ) 


Z -f Oji — 

R(P> a l> ®2> a $) 




D(x 0 + 5°, . . •) 7 
D{x 0 ...) 

(i + 5), 
l 

(47.22) 

whence 

dE x 

M = tt 1 
dx « 


(47.23) 

We also have 



(47.24) 

and, therefore, 

W( <7)-/}- ^ 


(47.25) 


or 


W(-ffW) = v«S B V(-?W 


(47.26) 
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Using (47.21) and (47.26) we can write the variation 8 1 of the logarithms of 
both sides in tho form 



8ju* „ „ 1 

~h— + V E" = — u m v V 
p* r c 2 ° 

(47.27) 

whence 

8jP* == -j**V 0 5« + ^ 

C 

(47.28) 

and by (47.16) 

8p* - -V 0 (p*^) + ^ u 0 u'Vjc,° 

(47.29) 


In calculating the variation 8 X of the expression (47.10) for u a we can use the 
result (47.21). The calculation gives 

SjW a = u‘ -- u a -- w 0 w v V v £° (47.30) 

ox X j c 

8E a du a 1 

and, from (47.15), 8 u a = u a --- (47.31) 

ox„ dx„ c 2 

ct a 

The last expression can be written in the form 

8= u°V a E« - 5°V 0 u“ — I (47.32) 

whence it is evident that 8 u a is a vector. We also have 

M a 8w a = 0 (47.33) 

in agreement with (47.11). 

As the charge density p* satisfies the same form of continuity equation 
(47.06) as the mass-density p* its variation will have the same form as (47.29), 
namely 

Sp* = -V 0 (p*S°) + ^ V’V,? (47.34) 

c 

Combining (47.32) and (47.34) and using (47.06) we get 

8(p*«*) = V 0 (p *u°%* - p*u*¥) (47.35) 

or S(p*« a ) = 1 ~ [a/( ■P*(u n Z* ~ « a £°)] (47.36) 

V ( ~ 9 ) 

whence it follows that the varied current vector also satisfies an equation of 
continuity, as was to be expected. 

We note that if the displacements are proportional to the velocity, so 
that E° = tm° where t is an arbitrary function of the coordinates, the varied 
velocity field and also the varied density do not differ from the non-varied 
ones. This can be verified by a direct calculation ; putting 5° = tuP into our 
equations gives 8u' J = 0, 8p* = 0 and 8p* = 0. 

Having calculated the variations suffered by all quantities as a result of 
displacements it is easy to verify that the field equations (47.02) and the 
equations of motion (47.03) represent the conditions that the integral 

S = J (c V - j + ~ F^F^j V(-9) ■ (dx) (47.37) 

should have an extreme value. For brevity we have written 

(dx) — dx 0 dx x dx 2 dx 3 


(47.38) 
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The integral (47.37) is to be taken over some four-dimensional region on the 
boundaries of which the variations c a and 8A a vanish. 

We calculate first the variation of the last term in (47.37). As is readily- 
verified 

8 (F^F*) = 2F x ‘ i 8F x(i (47.39) 

Therefore 


(47.40) 


~ 8 J V( -9) ■ V*) = ^ f 8F ag V( -9) ■ (*r) 

Since the tensor F a3 is antisymmetric the contributions of both terms in the 
bracket are equal, therefore we have 

- >f. V< ■-»>• w - - 4 - J *-» v< -9) ■ m 

- r f^<v'(- (47.41) 

‘17T J (23/^ 

after an integration by parts. 

The variation of the second term in (47.37) is, by (47.36), equal to 
— ^ 8 j* P*U*A X A /( -g) ■ ( dx ) 

= — - f p *u c ‘M x ^(-g)-(dx) — - f A rj 4 IV(-9)-P*( u °¥ " W*Z°)](dx) 
CJ c J cx a (47.42) 

Performing an integration by parts we obtain from the last integral in (47.42) 

- f ?*{- M’4°) 1 v (- 9) ■ ( dx ) 

C J CX o 

1 /• „ /8A x 8A\ 

= - l J P^FJ? V( -9) • ( dx ) 

Thus the second term gives 

- \8 1P*«M a V( ~9) ■ ( dx ) - - - c J ?*v?(8A a + FJ?) y/( -g) • (<&) (47.44) 
Finally, the variation of the first term in (47.37) is, by (47.29), 

8 f c 2 {x*'V / ( ~9) '(dx) = f [-e 2 V 0 (q*5°) +\L*u a u' l V£°]^(-g)-(dx) 

J J (47.45) 

After integration by parts the term on the right proportional to c 2 vanishes 
and the remaining term gives 

8 J c 2 p*V( -g) ■ ( dx ) == - J |i.*(t« v V v u„)£ n V( -g) ■ ( dx ) (47-46) 

For, as a result of the equation of continuity (47.05), we have 

P*m 0 m v V v 5° = V^y*u a u'%°) - [i*(w v V v M a )£ 0 (47.47) 

and the integral of the first term is zero. 


V(-g)-( dx ) 


(47.43) 
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Following (47.04) we introduce the notation 

W a = M v V v w 0 (47.48) 

for the acceleration and get 

S J cVV( . 9) ■ (dx) = - J [L*w a l a V( - 9) ' (dx) (47.49) 

Combining equations (47.41), (47.44) and (47.49) we obtain the following 
final expression for the variation of the action integral (47.37): 

SS = ~ I + 7" ^ ^^ 

+ I (“ 7 + i V ^" 0 ) SA * V( ~ 9) ‘ {dx) (47 ' 50) 

Here the coefficients of i;° and of SA^ are zero by the equations of motion 
(47.03) and the field equations (47.02). Thus we have proved that 

SS = 0 (47.51) 

Conversely, from the condition that the action integral 

S = | (cV* - -j u«A a + i- V( -9) ’ («&) (47.37) 

be an extremum and, using the fact that the variations 5° and SA X are arbitrary, 
we can conclude that the equations of motion and the field equations are 
satisfied. 

The circumstance that a variation of the form £ a = tu° does not change the 
velocity field, gives rise to the relation 

u°(g.*w a + ^ = 0 (47.52) 

which is valid identically, i.e. independently of the equations of motion being 
satisfied. This relation shows that of the four equations of motion only three 
are independent. The fact that a variation of the form SA a — S 8tpldx a does 
not change the electromagnetic field gives rise to the relation 

v a(- “ + ^. = 0 (47.53) 

which is also identically fulfilled, i.e. independently of the field equations. 
This relation shows that the four field equations are not independent but are 
related by (47.53). 

48. The Variational Principle and the Energy Tensor 

In Section 45 we pointed out that in the formulation of the Theory of 
Relativity in arbitrary coordinates two points of view are possible. From the 
first point of view the are considered as given functions of the coordinates 
which are obtained by the usual tensor rule from their Galilean values. From 
the other the g are thought of as unknown functions subject to the equation 

= 0 (48.01) 

and also to some inequalities which we have stated repeatedly. 
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When calculating the variation of the action integral we adopted the first 
point of view and, accordingly, the form of the functions <7 was not varied. 
We can, however, adopt the second point of view and vary the <7 also. This 
will lead to the appearance of a third term in the expression (47.50) for 8 ,S' 
in addition to the two terms corresponding to the variations 5° and 8A a , a 
term arising from the variation of the g^. We shall now calculate this term. 

We shall denote the variations of quantities due to the variation of the 
ffnv by the symbol 8 g . In (47.12) neither the Jacobian I nor the function F 
contains the g , therefore 


Mu*' V( -9 )} dfj dp dfj dp 8g { 


1 


P-*V(~9) 

and consequently 


__ _ u^u'* 

2 <? a g( dfJ dp)( df & l dp) 2 c 2 11 




(48.02) 


(48.03) 


Further, if we multiply (47.10) by (47.12) we obtain an expression that does 
not contain the ; the variation of this expression with respect to g^ will be 
zero. Since the charge density p* satisfies the same continuity equation as 
the mass density p*, the variation of the corresponding expression with p* 
replaced by p* is also zero. Thus 


Mp* m V(..?)} = 0 (48.04) 

and also 8 g { P *u*A^(-g)} = 0 (48.05) 

as the A a are now not varied. 

Further, 

W“ 0 ) = ^S ^“ 3 (48.06) 

since the F aP are not varied, but only the F^, for they are connected with the 
by the relations 

F* = g av -g^F^ (48.07) 

which contain the g^. The calculation gives 

FJ g F* & = 2 W ^ = -2 F'tFwg*’ 8g^ (48.08) 

Since y'( 'g) = \ (48.09) 


we get 

-J—^ 8{F V(-<7)} = (-2W" + l - F a(i F«Y'j K- = 

(48.10) 

where U ,VJ is the expression (46.22) for the electromagnetic energy tensor. 

Equations (48.03), (48.05) and (48.10) result in the following additional term 
in the variation of the action integral 

fyS = 1 1 (p*W + Un 8g^ V( -9) ■ (dx) (48.11) 

But according to (46.32) the expression in brackets under the integral is c 2 
times the mass tensor of the system of particles and field : 

c 2 T^ = p*M*V + 


(48.12) 
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Thus, the expression for the mass tensor appears automatically when the action 
integral is varied with respect to the components of the metric tensor. 

The complete variation of the action integral with respect to all functions 
involved in it is 

8S ^ l j ((**«“*'' + V(-9)'(dx) 

~ J (V w a + J ■(<]*) 

+ J ( - ^ + 1 V p ^8.4„ V( -g) • (dx) (48.13) 

The variations Z,° and were completely arbitrary; equating their coefficients to 
zero gave us the equation of motion and the field equations. The variations Sg^, 
however, are not arbitrary because the functions g are subject to the condition 

= 0 ( 48 - 01 ) 
We shall now express the variations S< 7 ptv in terms of independent arbitrary 
quantities. 

The most general form of functions g ■ which satisfy (48.01) can be obtained 
by coordinate transformation starting from the Galilean values of these 
quantities, therefore all possible forms of the g ■ compatible with (48.01) can 
also be obtained one from the other by coordinate transformations. Thus 
the permissible infinitesimal variations of these functions must correspond to 
infinitesimal coordinate transformations. 

Let an infinitesimal coordinate transformation be written in the form 


x' a -| rf (48.14) 

where rf is an arbitrary infinitesimal vector whose components are functions 
of x 0 , aq, Xo and x 3 . According to the general transformation formula for 
tensors we have , , 

„'»(*■) ^ (48.15) 


and, ignoring higher order infinitesimals, 


, ... . , dv) v dY 

<7 1X7 (x') == <r z + -L + 0 «-L 

cx„ ox„ 


(48.16) 


To obtain the variation, i.e. the change of the form of the functions g wv , one 
must compare g' !1 '‘ and g [, '‘ for the same values of the arguments. 

As we have 

g [1 'V)<rix) -I- -f—r? (48.17) 

CX a 

(having replaced g by g iV ‘ in the correction term), it follows that 


o'rf ihf By 11 * 


<r(x) = S(/' iv (a') - g** -A -| ,/ n —L 
This equation can be written as 

&T = -f g^Vjf 

or, more briefly, as 

Sg' 1 ' 1 -fi V v yf 


(48.18) 


(48.19) 

(48.20) 
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In the case of the Maxwell-Lorentz equation we verified this by direct calcula¬ 
tion (see equation (48.12)). 

Let us illustrate the above reasoning with another example, namely, the 
equations of hydrodynamics. In this case the action integral will be of the 
form 

s = J (p*c 2 + p*n ) V( -g) ■ (dx) (48.28) 

Here p* is the invariant density of that part of the rest mass which does not 
change in the motion and which satisfies the conservation law 


V v (p*«’) = 0 (48.29) 

(see Section 32) and n is the potential energy per unit mass of elastic com¬ 
pression of the fluid 

II = 

p being the pressure. 

It is simple to calculate the variation of the integral S, from the expressions 
for the variation of density derived above. Using equations (47.29), (48.03) 
and (48.09), in which we now write p* instead of p* the total variation becomes 


r#_ p_ 

J ?* ?* 


dll = 


p dp* 


(48.30) 


o* o* IvP-vC \ 

8p* = -V„(pn°) + ^ - grjSg^ (48.31) 


and, as before. 
Putting 

we have 


Wi-9) 

Vi-9) 



F( P *) = p *c 2 + P *n 


(48.32) 

(48.33) 


SS 


= 8jF(p*) Vi-9Hdx) = j 


F'(?*) Sp* + F(p*) Vi -9) ■ idx) 


Vi ~9) 

Inserting (48.31) and (48.32) and integrating by parts we get 


(48.34) 


SS 


(F-p*F')9>» +p*F' — 




S f/ |lv Vi - 9) ■ i dx ) 


lWi-9)-idx) (48.35) 


where w a is the acceleration (47.48). Using the relations 


p *F' — F = p; p*F"dp* = dp 
which follow from (48.30) and (48.33) we can replace (48.35) by 


8 S 


-ij 

►je- 


■ptf" -f 


p* + 2 (p*n + p) 
c l 


dp u a m v dp 

c 2 3x„ 


p* + - ( P *n + V ) 

C“ 


(48.36) 

w !i w v | Vi-9) ■ ( dx ) 

wk°Vi~9)-idx) (48.37) 
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The analogous equation 

= - Vb - ^ (48.21) 

can be obtained either independently by the same method or by use of the 
relation = 8*. 

Rewriting (48.11) as 

W = ~ I T^S ffuv V( ~9) ’ (dx) (48.22) 

and inserting (48.21) we get 

- | 2 f T-(V^] v + V v Y)J V(- 9 )-(dx) = -c* f T» T, l% V(- 9 )-(dx) 

(48.23) 

since the tensor T is symmetric. After integration by parts we get 

V = c 2 J (V v T»X V( - 9) ■ (dx) (48.24) 

If it is assumed that the divergence of the mass tensor vanishes, it can be 
concluded from (48.24) that the action integral also has an extremal value for 
permissible variations of the g . However, a reversal of the argument is also 
possible and the equation 

V v T“ v = 0 (48.25) 

can be deduced from the properties of the action integral, T’ |iV being the tensor 
that appears as the coefficient of 8 'g' in the expression for the complete variation 
of S. For the action integral is a scalar, or invariant, and therefore does not 
change for an infinitesimal or any other coordinate transformation. The 
expression for its complete variation 

8<S = c*J (^KV(~9)-(dx) 

- | (pX + £ F oa u^KW(-9)-(dx) 

+ J (-£«« + 1 V 3 F*3'jx v /( - 9) ■ (dx) (48.26) 

must necessarily vanish, if the variations c? and SA a occur as a result of the 
infini tesimal transformation tj . This conclusion follows from the invariance 
property alone and is not dependent on whether S is the action integral or any 
other invariant integral. But, if S is the action integral, the coefficients of 
i-° and 8 A a are separately equal to zero. Therefore, the remaining terms in S 
must also be zero and we have 

8 g S = c 2 J (V^X\/( ~9) ■ (dx) = o (48.27) 

for arbitrary y) . Hence (48.25) follows. 

The fact that the tensor T |1V , defined as the coefficient of 8g (iV , is the mass 
tensor or proportional to it, follows from the uniqueness of the mass tensor 
established in Section 31, under the condition that its components are functions 
of the state of the system ; here the g are to be included among such functions. 
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Hence we conclude that the equations of motion have the form 
’ 1 .] dp u v? dp 

[p* + -,(P-n +y )j 

and that the energy tensor is 


c 2 T 


p* + - ( P *n + P ) 

C 


u tL u v — pgv- 
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(48.38) 


(48.39) 


We know also that the divergence of this tensor vanishes. 

These expressions are a generalization of (32.29) and (32.28) and reduce to 
them in Galilean coordinates. 

In conclusion, we remark that, mathematically speaking, by no means every 
system of equations can be derived from a variational principle. The fact 
that the basic equations of mathematical physics can be so derived reveals a 
remarkable property possessed by them. 


49. The Integral Form of the Conservation Laws in Arbitrary 
Coordinates 

As we know, the mass tensor 2 1|iV satisfies the equation 

= 0 (49.01) 

which, for a closed conservative system, must express the conservation laws 
for energy and momentum and also the conservation law for angular momentum 
and the law of mass centre motion. 

In explicit form (49.01) can be written as (see (41.24)): 

dny* 

— + r^2*P = 0 (49.02) 

or, if the first and last terms are combined, 


-77— iV - -- + r»T» = 0 (49.03) 

Y(-ff) dx . 

Owing to the fact that in (49.03) there is a term remaining outside the sign 
of differentiation the passage from the differential to the integral form of the 
conservation laws is not quite as obvious in general as in Galilean coordinates 
(see Section 31). Therefore, we shall discuss in more detail the question of the 
existence of integral forms of the conservation laws. 

We introduce a vector cp^ and integrate the equation 

= ^r g) ~ W(-9) ■ (49.04) 

over a three-dimensional volume on the boundaries of which T ■" vanishes. 
We get 

J ,V v (T 1 ^) \/( ~9 )' dx i dx 2 dx a — ^ J y( — g) ■ dx x dx t dx 3 (49.05) 
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If T' 1 ' 1 is the mass tensor it is symmetric and of vanishing divergence. For such 
a tensor the previous equation can be written as 


d 

dx 0 


J 


T' ia ^ l± V (. g) ■ dx t dx 2 dx 3 

= ?> J 


+ V (i < Pv) V( -9) ■ dx l dx 2 dx 3 


(49.06) 


The quantity 7 = J T [l \ IM \/(—g)-dx 1 dx 2 dx 3 (49.07) 

will be constant, i.e. will be independent of x 0 , the coordinate that has the 
character of time, if the vector ip satisfies the equations 

V v 9, + V |/?v = 0 (49.08) 

We first discuss these equations in the general case that the fourth order curva¬ 
ture term is not necessarily zero. 

We put 

Vv?* = TV ( 49 - 09 ) 

By (49.08) the <p form to antisymmetric tensor ; we calculate its covariant 
derivative. By (49.08) we can write 


cTPuv = 9 ( V n V v — V v V> 0 + ■= (V n V v — V v V o )q >|i — - (V (J V |i — V^VJ 


(49.10) 


Using the relation (43.16) for the difference of covariant derivatives and the 
symmetry properties of the tensor R we get 

V o? |i v = (49- 11 ) 

Thus the equation (49.08) can be replaced by the system (49.09) and (49.11) 
which is equivalent to a system of total differential equations.! It can be 
shown that this system will be completely integrable in the case that 


R „v, ocd = K (SWSU - 9^9, & ) (49.12) 

where K is a constant. A space in which the curvature tensor has the form 
(49.12) is called a space of constant curvature ; it is a four-dimensional generaliza¬ 
tion of Lobachevsky space. The constant K is called the constant of curvature. 

If the conditions of complete integrability are fulfilled, the values of the 
ten functions <p |x and <p are determined throughout space by their values at 
one point, so that the general solution of the system (49.09) and (49.11) involves 
ten constants. 

We are interested in the case 


R 


M-V, a3 


= 0 


(49.13) 


which is obtained from (49.12) by taking K = 0. (For this reason a space for 
which 7? p = 0 is called a space of zero curvature, or a flat space.) 

In flat space-time, with which we are concerned in the Theory of Relativity, 
we can immediately indicate ten integrals of the system (49.08). Let x' g , x' v 
x 2 and x 3 be Galilean coordinates. We can then introduce the four vectors 


<p<0> = £4 ; 

dx.. ’ 


= 


dx[ 

dx.. 


cp<*> = ; 

^ dx^ ’ 


<p<3> = £3 


+ An exposition of the theory of such systems can be found in Reference [14]. 


(49.14) 
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(These quantities are vectors with respect to the unprimed coordinates x 0 , x x , 
x 2 , x 3 ; the Galilean coordinates are to be considered as scalars.) By virtue of 
(42.16) each of the four vectors (49.14) satisfies 


V v9 <f = 0 ; (a = 0, 1, 2, 3) 

(49.15) 

and, therefore, also equation (49.08). 


Further, we can introduce the six vectors 


dx'a , dx' 


911 “ K *^ 

(49.16) 

each of which satisfies 


V v «pW J) + = 0 

(49.17) 

which is the same as (49.08). 



Insertion of the vectors (49.14) into (49.07) and multiplication by c gives 
the constant integrals 


P = c J T* 0 -y/(— g)-dx! dx 3 dx 3 (49.18) 


and insertion of the vectors (49.16) into the same expression gives the further 
constant integrals 

= c J T^ X ' g g) ^(-yl.dx.d^dx, (49.19) 

Comparing these expressions with their counterparts in Galilean coordinates 
(equations (31.02), (31.03) and (31.08), (31.09)) it is simple to see their physical 
meaning. The quantity P'° is the energy divided by c, or the mass multiplied 
by c, the I yi (i = 1, 2, 3) are the components of momentum. Further, the 
M' t0 are the integrals of mass centre motion and the M' ik the integrals of angular 
momentum. 

The P'“ are components of a constant vector and the M'*® the components 
of a constant antisymmetric tensor with respect to a Galilean coordinate 
system. But to a constant vector or tensor in a Galilean system there corresponds 
in an arbitrary coordinate system a free vector or tensor, i.e. one all of whose 
covariant derivatives vanish. Therefore, in an arbitrary coordinate system 
we can relate to the integrals of motion the free vector 

a r ' 

p v = 2 e * p,a t- 2 ( 49 - 2 °) 


dx 


and the free antisymmetric tensor 

■Jh dx' 8x' a 


M„ 


a, ( 5=0 


dx^ dx y 


(49.21) 


These quantities will be functions of the space and time coordinates satisfying 


V 0 P v = 0; V (J i¥ (JV = 0 (49.22) 

The values of these functions at all points are determined by their values at 
any one point. Therefore, the number of constants on which the P v and 
depend will be equal to the number of these quantities, namely, ten. The 
constants of motion P' a and are just these constants. 
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In the above discussion the role of the Galilean coordinates was purely 
auxiliary, they were simply used as variables in which to express the various 
vectors corresponding to the different integrals of equation (49.08). It is, of 
course, much simpler to conduct the whole argument in terms of Galilean 
coordinates, as was done in Chapter II. However, the very aim of the more 
complicated discussion was to show that even the integral form of the con¬ 
servation laws can be derived directly from equations written in arbitrary 
coordinates. 

The formulae given in the last two sections of this chapter enable us to formu¬ 
late the condition for the space-time to be uniform. This can be done in the 
following way. 

As stated in the Introduction, space-time is of maximum uniformity if a ten- 
parametric transformation group exists that leaves the functional form of the 
g invariant. (This means that the g transformer! according to the tensor rule 
ate the same functions of the new variables as the original g WJ were of the old.) 
For an infinitesimal transformation (48.14) belonging to this group we have 
= 0, where is given by (48.21). But if in the equations = 0 we 
replace the yj v by the <p v the equations will take on the same form as that of the 
equations (49.08), which are related to the conservation laws. For these latter 
equations the condition for total integrability is (49.12). Thus this equation 
will also be the condition for integrability of the equations 8 g = 0. 

We see that the necessary and sufficient conditions for space-time to be 
uniform coincide with the conditions for the existence of an integral form of 
the conservation laws for the mass tensor. Therefore the conservation laws for 
those forms of energy which are included in the mass tensor are directly 
connected with the uniformity of space-time. 


49*. Remark on the Relativity Principle and the Covariance of 
Equations 

At the beginning of this book (Section 6) we gave a formulation of the prin¬ 
ciple of relativity, which together with the postulate that the velocity of light 
has a limiting character, may be made the basis of relativity theory. We shall 
now investigate in more detail the question of the connection of the physical 
principle of relativity with the requirement that the equations be covariant. 
We have already touched on this problem in the Introduction. 

In the first place, we shall attempt to give a generally covariant formulation 
of the principle of relativity, without as yet making this concept more precise. 
In its most general form the principle of relativity states the equivalence of the 
coordinate systems (or frames of reference) that belong to a certain class and 
are related by transformations of the form 

K = ff x o> x i> x 2> x s) (49**01) 

which may be stated more briefly as 

x'=f(x) (49*02) 

It is essential to remember that, in addition to the group of permissible transformations, 
the class of coordinate systems must be characterized by certain supplementary conditions. 
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Thus, for instance, if we consider Lorentz transformations, it is self-evident that these linear 
transformations must connect not any arbitrary coordinates, but only the Galilean coordi¬ 
nates in two inertial reference frames. To consider linear transformations between any other 
(non-Galilean) coordinates has no sense, because the Galilean principle of relativity has no 
validity in relation to such artificial linear transformations. On the other hand, if one intro¬ 
duces any other variables in place of the Galilean coordinates, a Lorentz transformation can 
evidently be expressed in terms of these variables, but then the transformation formulae 
will have a more complicated form. 

Let us now state more precisely what is meant in the formulation of the 
principle of relativity by equivalence of reference frames. Two reference frames 
(x) and ( x') may be called physically equivalent if phenomena proceed in the same 
way in them. This means that if a possible process is described in the coordinates 
(x) by the functions 

?2(*), • - •, (49* 03) 

then there is another possible process which is describable by the same functions 

9l (x'), 9i (x'), ..., 9n (x') (49*.04) 

in the coordinates (x'). Conversely any process of the form (49.04) in the second 
system corresponds to a possible process of the form (49*.03) in the first system. 
Such a definition of corresponding processes agrees fully with that given in 
Section 6. Thus a relativity principle is a statement concerning the existence of 
corresponding processes in a set of reference frames of a certain class ; the systems 
of this class are then accepted as equivalent. It is clear from this definition 
that both the principle of relativity itself and the equivalence of two reference 
frames are physical concepts, and the statement that the one or the other is 
valid involves a definite physical hypothesis and is not just conventional. In 
addition, it follows that the very notion of a “ principle of relativity ” becomes 
well defined only when a definite class of frames of reference has been singled 
out. In the usual theory of relativity this class is that of inertial systems. 

The functions (49*.03) or (49*.04) describing a physical process will be called 
field functions or functions of state. We have already indicated in Section 46 
that in a generally covariant formulation of the equations describing physical 
processes the components g of the metric tensor must be included among the 
functions of state. In the example there discussed we then get the following 
collection of field functions: 

JU*)> 9jx) (49*.05) 

i.e. the electromagnetic field, the current vector and the metric tensor. The 
requirement entering the formulation of a principle of relativity that in two 
equivalent reference frames corresponding phenomena should proceed in the 
same way applies equally to the metric tensor. Thus if we compare two cor¬ 
responding phenomena in two physically equivalent reference frames, then for 
the first phenomenon, described in the old coordinates, not only the compo¬ 
nents of electro-magnetic field and of current density, but also the components 
of the metric tensor must have the same mathematical form as for the second 
phenomenon described in the new coordinates. 

What can be concluded further will depend on whether we assume that the 
metric is fixed or whether we take into consideration phenomena which them¬ 
selves influence the metric. In the usual theory of relativity described in the 
previous chapters it is assumed that the metric is given once and for all and 
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does not depend on any physical processes. The generally covariant formulation 
of the theory of relativity given in the present chapter does not change anything 
in this. As long as the assumption remains in force that the character of space- 
time is Galilean and the <? |XV are introduced only to achieve general covariance, 
these quantities will depend only on the choice of coordinate system, not on the 
nature of the physical process discussed; they are functions of state only in a 
formal sense. In the theory of gravitation on the other hand, to the description 
of which we turn in the following chapter, a different assumption is made 
concerning the nature of space-time. There the g w , are functions of state, not 
only in a formal sense, but in fact: they describe a certain physical field, namely 
the field of gravitation. However, when discussing small-scale processes which 
do not influence the motion of heavy masses one can also assume that the 
metric is fixed (though not Galilean). 

To give a definite meaning to the principle of relativity in such circumstances, 
it is essential to specify more closely not only the class of coordinate systems, 
but also the nature of the physical processes for which the principle is being 
formulated. 

We shall first start from the assumption that the metric is fixed (“ rigid ”), 
or else that it may be considered as fixed for a certain class of physical processes. 
We return to the above definition of corresponding phenomena in two physically 
equivalent coordinate systems, according to which all field functions, including 
the components of the metric tensor, must have the same mathematical form 
for the first.process described in the old coordinates as for the second process 
described in the new coordinates. If the g are independent of the nature of 
the phy sical phenomenon, then in relation to these quantities we need not make 
a distinction between the first and second process, and need consider only 
transformations of the coordinates. But then the quantities 

g^(x) and g\Jx') (49*06) 

will be connected by the tensor transformation rule; the requirement of the 
relativity principle that they should have one and the same mathematical form 
reduces (for infinitesimal coordinate transformations) to the equations Sg^ = 0 
discussed in Sections 48 and 49. 

We know that the most general class of transformations that satisfies these 
equations contains 10 parameters and is possible only in uniform space-time, 
where the relation 

«a = K ' (£v*5W -(49*07) 

(see equation (49.12) ) is valid. If in these relations K is zero, the space-time 
is Galilean and the transformations in question arc Lorentz transformations, 
except that possibly they may be written down in other (non-Galilean) co¬ 
ordinates. 

Thus with the rigidity assumption for the metric , the principle of relativity im¬ 
plies the uniformity of space-time, and if the additional condition K 0 holds, 
we obtain a Galilean metric in appropriate coordinates. The relativity principle 
in general form then reduces to the Galilean relativity principle. As for the 
condition Ii = 0, it results in an additional uniformity of space-time; if the 
scale of the Galilean coordinates is changed, then the scale of the elementary 
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interval changes in the same proportion.! This property implies in turn that 
there is no absolute scale for space-time, unlike the absolute scale that exists 
for velocities ip terms of the velocity of light; the absence of an absolute scale 
for space-time leads conversely to the equation K = 0. 

If we now go over to discuss phenomena which may themselves influence the 
metric, we must reckon with the possibility that under certain conditions the 
principle of relativity will be valid in non-uniform space also. For this to be so, 
it is necessary that the motion of the masses producing the non-uniformity be 
included in the description of the phenomena. 

Indeed, at the end of this book it will -be shown that under the assumption 
that space-time is uniform at infinity (where it must be Galilean) one can single 
out a class of coordinate systems that are analogous to inertial systems and 
defined up to a Lorentz transformation. In relation to this class of coordinate 
systems a principle of relativity will hold in the same form as in the usual theory 
of relativity, in spite of the fact that at a finite distance from the masses the 
space is non-uniform. Here however one must bear in mind the essential role 
played by the boundary conditions that require uniformity at infinity. Thus 
in the last analysis the relativity principle is here also a result of uniformity. 

Since the greatest possible uniformity is expressed by Lorentz transformations 
there cannot be a more general principle of relativity than that discussed in 
ordinary relativity theory. All the more, there cannot be a general principle of 
relativity, as a physical principle, which would hold with respect to arbitrary 
frames of reference. 

In order to make this fact clear, it is essential to distinguish sharply between 
a physical principle that postulates the existence of corresponding phenomena 
in different frames of reference and the simple requirement that equations 
should be covariant in the passage from one frame of reference to another. 
It is clear that a principle of relativity implies a covariance of equations* but 
the converse is not true: covariance of differential equations is possible also 
when no principle of relativity is satisfied. Quite apart from the fact that not 
all laws of nature reduce to differential equations, even fields described by 
differential equations require for their definitions not only these equations, but 
also all kinds of initial, boundary and other conditions. These conditions are 
not covariant. Therefore the preservation of their physical content requires a 
change in their mathematical form and, conversely, preservation of their 
mathematical form implies a change of their physical content. But the realiz¬ 
ability of a process with a new physical content is an independent question 
which cannot be solved a priori. If within a given class of reference systems 
“ corresponding ” physical processes are possible, then a principle of relativity 
holds. In the opposite case it does not. It is clear, however, that such a model 
representation of physical processes, and in particular such a model representa¬ 
tion of the metric, is possible at most for a narrow' class of reference systems, and 
certainly cannot be unlimited. This argument shows once again (without in¬ 
voking the concept of uniformity) that a general principle of relativity, as a 
physical principle, holding in relation to arbitrary frames of reference, is impossible. 

t It should be remembered that in the general ease the expression for ds 2 , though always 
a homogeneous function of the coordinate differentials, may also depend in a non- 
homogeneous way on the coordinates themselves. 
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But as a motivation of the requirement of covariance of the equations a 
general principle of relativity is also unnecessary. The covariance requirement 
can be justified independently. It is a self-evident, purely logical requirement 
that in all cases in which the coordinate system is not fixed in advance, equations 
written down in different coordinate systems should be mathematically equiva¬ 
lent. The class of transformations with respect to which the equations must be 
covariant must correspond to the class of coordinate systems considered. Thus 
if one deals with inertial systems related by Lorentz transformations and if 
Galilean coordinates are used, it is sufficient to require covariance with respect 
to Lorentz transformations (as was done in Chapters I and II of this book). 
If, however, arbitrary coordinates are employed, it is necessary to demand 
general covariance (Chapter IV). 

It should be noted that covariance of coordinate systems acquires a definite 
physical meaning if, and only if, a principle of relativity exists for the class of 
reference frames used. Such is the covariance with respect to Lorentz trans¬ 
formations. This concept was so fruitful in the formulation of physical laws 
because it contains concrete chrono-geometric elements (rectilinearity and uni¬ 
formity of motion) and also dynamic elements (the concept of inertia in the 
mechanical and the electromagnetic senses; Section 5). Because of this, it is 
related to the physical principle of relativity and itself becomes concrete and 
physical. If, however, in place of the Lorentz transformations one discusses 
arbitrary transformations, one ceases to single out that class of coordinate 
systems relative to which the principle of relativity exists, and by doing this 
one destroys the connection between physics and the concept of covariance. 
There remains a purely logical side to the concept of covariance as a consistency 
requirement on equations written in different coordinate systems. Naturally 
this requirement is necessary, but it is always satisfiable. 

In dealing with classes of reference frames that are more general than that 
relative to which a principle of relativity holds, the necessity arises of replacing 
the explicit formulation of the principle by some other statement. The explicit 
formulation consists in indicating a class of physically equivalent frames of 
reference; the new formulation must express those properties of space and time 
by virtue of which the principle of relativity is possible. With the assumption 
of a rigid metrie this is achieved by introducing an additional equation (49*.07). 
We saw in this chapter that with the additional assumption of the absence of 
a universal scale (K — 0) these equations lead to a generally covariant formula¬ 
tion of the theory of relativity, without any alteration of its physical content. 
The Galileo-Lorentz principle of relativity is then maintained to its full extent. 

The very possibility of formulating the ordinary theory of relativity in a 
generally covariant form shows particularly clearly the difference between the 
principle of relativity as a physical principle and the covariance of the equations 
as a logical requirement. In addition, such a formulation opens the way to 
generalizations based on a relaxation of the assumption of a rigid metric. This 
relaxation provides the possibility of replacing the supplementary conditions 
(49*.07) by others which reflect better the properties of space and time. This 
leads us to Einstein’s theory of gravitation, which will be discussed in the 
following chapters. 



CHAPTER V 


THE PRINCIPLES OF THE THEORY OF GRAVITATION 


50. The Generalization of Galileo’s Law 


The most essential characteristic of the gravitational field by which it differs 
from all other fields known to physics reveals itself in the effect of the field on 
the motion of a freely moving body or mass point. In a gravitational field all 
otherwise free bodies move in the same manner, provided the initial conditions 
of their motion, i.e. their initial positions and velocities, are the same. This 
fundamental law may be thought of as a generalization of Galileo’s law that 
in the absence of resistance all bodies fall equally fast. 

It is appropriate to recall at this point the definitions of inertial mass and of 
gravitational mass. Inertial mass is the measure of the ability of a body to 
resist acceleration; for a given force the acceleration is inversely proportional 
to the inertial mass. Gravitational mass is the measure of the ability of a body 
to produce a gravitational field and to suffer the action of such a field; in a 
given field the force experienced by a body is proportional to the gravitational 
mass. 

Using these definitions the aforementioned generalization of Galileo’s law 
can be formulated as a statement that the inertial and the gravitational masses 
of any body are equal. 

According to Newton the gravitational field can be characterized by the 
gravitational potential U(x, y, z). The gravitational potential produced by an 
solated spherically symmetric mass M at points exterior to itself is 



(50.01) 


where r is the distance from the centre of the mass. The quantity y is the New¬ 
tonian constant of gravitation—in c.g.s. units it has the value 


1 cm 3 
Y = 15 000 000 g.sec 2 


(50.02) 


Thus TJ has the dimensions of the square of a velocity. We note immediately 
that in all cases encountered in Nature, even on the surface of the Sim or of 
super-dense stars, the quantity U is very small compared to the square of the 
speed of light 

U ^ c 2 (50.03) 

In the general case of an arbitrary mass distribution the Newtonian potential 
U it produces satisfies Poisson’s equation 

AZ7 = —4ttyp (50.04) 

where p is the mass density. The Newtonian potential is fully determined by 
Poisson’s equation together with continuity and boundary conditions which 
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are as follows: the function U and its first derivatives must be finite, single¬ 
valued and continuous throughout space and must tend to zero at infinity. 

Let us assume that the Newtonian potential V is given. The force experienced 
by a body (mass point) of gravitational mass (m) gI in a gravitational field of 
potential U is 

F = (m) gr grad U (50.05) 

On the other hand, by Newton’s laws of motion, we have 

(m) jn w = F (50.06) 

Therefore 

(m) in w = (m) gr grad U (50.07) 

By the generalization of Galileo’s law the motion of the body in a given gravita¬ 
tional field cannot depend on its mass. Therefore, the ratio of inertial mass 
(m) in to gravitational mass (m) gt must be the same for all bodies ; it is thus a 
universal constant whose value can only depend on the choice of units for the 
two masses. In the units quite generally accepted one has simply 

(m) in = (m) gT = m (50.08) 

so that inertial and gravitational masses are equal. 

The equality of inertial and gravitational mass is such a familiar fact that it 
is usually accepted as something obvious. However, the matter is not so 
simple : their equality is a separate and very important law of Nature, closely 
connected with the generalization of Galileo’s law. 

As a result of the equality of inertial and gravitational mass the equation 
of motion 

w = grad U (50.09) 

has universal character, and thus formally expresses the generalization of 
Galileo’s law. 

We note that the equations of motion (50.09) can be obtained from the 
variational principle 

8 J (|n 2 + U)dt = 0 (50.10) 

This fact will be a guide to us in constructing the theory of gravitation. 

51. The Square of the Interval in Newtonian Approximation 

The phenomenon of universal gravitation forces us to widen the framework 
of the theory of space and time which was the subject of the foregoing chapters. 
The necessity of this widening becomes clear from the following considerations. 

It follows from the equation of wave front propagation, which can be stated 
in the form 

1 /0o>\ 2 f/eW\* /du>\ 2 /dco\ 2 ] , 

?(a) ~ [(s) + (*) + (&) ] ='° <61 ' 01) 

that fight is propagated in straight lines. But light possesses energy and by the 
law of proportionality of mass and energy all energy is indissolubly connected 
with mass. Therefore light must possess mass. On the other hand, by the 
law of universal gravitation, any mass located in a gravitational field must 
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experience the action of that field and in general its motion will therefore 
not be rectilinear, f Hence it follows that in a gravitational field the law of 
wave front propagation must have a form somewhat different from the one 
given above. But the equation of wave front propagation is a basic character¬ 
istic of the properties of space and time. Hence it follows that the presence of 
the gravitational field must affect the properties of space and time and their 
metric is then not a rigid one. This is indeed the conclusion reached in the 
theory of gravitation which we now begin to construct. 

As was shown in Chapter I, the equation of wave front propagation (51.01), 
with some additional assumptions, leads to the following expression for the 
square of the interval: 

ds 2 = c 2 dt 2 - (dx 2 + dy 2 + dz 2 ) (51.02) 

The influence of the gravitational field on the properties of space and time 
must have the consequence that the coefficients in the equation of wave front 
propagation and in the expression for the square of the interval will differ 
from the constant values appearing in (51.01) and (51.02). We must now find 
an approximate form for the square of the interval in a gravitational field of 
Newtonian potential U, relying on the generalization of Galileo’s law to guide 
us. The fundamental fact that the law of motion for a body moving freely 
in a gravitational field is a universal one which does not depend on the nature 
of the body permits us to find the relation between the law of motion and 
the metric of space-time. 

The equations of a geodesic in a space-time with given metric were studied 
in Section 38. We shall now try to find a metric such that these equations 
coincide approximately with the Newtonian equations of motion for a free 
body in a given gravitational field. If this attempt is successful it will enable 
us to introduce the hypothesis that in a space-time with given metric a free 
body (mass point) moves along a geodesic ; in this way the connection between 
the law of motion and the metric will be established. 


As we know, the equation of a geodesic may be derived from the variational 
principle 


8 J ds = 0 

(51.03) 

If the squared interval is of the form (51.02) we have 

ds = ^{c 2 ~ v 2 )dt 

(51.04) 

/ w 2 \ 

or, for small velocities, ds = I c — — \dt 

\ 2c/ 

(51.05) 


Inserting (51.04) or (51.05) into (51.03) gives equations that describe motion 
with constant velocity, which indeed is free motion in the absence of a gravita¬ 
tional field. We can now assume that for small velocities and weak gravitational 
fields (V < 3 : c 2 ) the expression for the interval takes the form 

ds = ^(c 2 -2 U - v 2 )dt (51.06) 

ds=\c- 1 - (|v 2 + H)j dt (51.07) 

t The theory of the deflection of light in a gravitational field is given in Section 59 below. 


or 
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in place of (51.04) or (51.05). Since neither an additive constant nor a constant 
multiplier are of any importance in a Lagrangian the variational principle 
(51.03), with ds taken from (51.07), gives the same result as the variational 
principle 

S J (|« 2 + V)dt = 0 (51.08) 

which was formulated at the end of Section 50, but this did indeed describe 
free motion of a body in a gravitational field. 

It is true that just because additive constants and multiplicative factors in a 
Lagrangian are immaterial equation (51.08) could be obtained from (51.03) and 
(51.06) with any sufficiently large value of the constant c ; but we must require 
that in the absence of gravitation, when Z7 = 0, the expression (51.06) for the 
interval shall go over into the Galilean form (51.04) whatever the value of v 2 . 
This requirement fixes the constant c in (51.06) to be equal to the speed of light. 
These arguments give us good reason to assume that under the conditions 

U <4 c 2 



2 

= V 2 <-S C 2 


(51.09) 


the square of the interval differs little from the form 

ds 2 = (c 2 — 2U)dt 2 — (dx 2 -f dy 2 + dz 2 ) (51.10) 


Here the relative error in the coefficient of dt 2 will certainly be of higher order 
than the term 2f7/c 2 which is included. As regards the coefficient of the purely 
spatial part of the interval, it may differ from unity by a quantity of the 
same order as 2f7/c 2 . Indeed, the theory of gravitation to be developed in the 
following sections gives the more exact expression 

ds 2 = (c 2 — 2 U)dt 2 — ^1 + — ^{dx 2 + dy 2 + dz 2 ) (51.11) 

Under the conditions (51.09) the difference between (51.10) and (51.11) is 
negligible, as it should be. 

In principle the value found for the coefficient of dt 2 is capable of experimental 
verification. 

Let us assume that at some point (x v y lt z x ) at which the gravitational potential 
is U lt there is some emitter of radiation of proper period T 0 . The wave radiated 
by it will have a time dependence of the form 


exp 



(51.12) 


where 2\ is not equal to T 0 but is related to it in the same way as dt is related 
to dx, the differential of the proper time of the emitter. If, for simplicity, the 
emitter is assumed to be at rest in the frame of reference chosen, we have 
approximately 


and therefore 



(51.13) 

(51.14) 
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In this problem the dependence of gravitational potential on time may be 
ignored, so that the gravitational field can be treated as static. Then the 
wave being propagated from the emitter will retain its time dependence (51.12) 
throughout space. 

Let us now assume further that at some other point ( x 2 , y 2 , z 2 ), where the 
gravitational potential has a different value U 2 , there is a second identical 
emitter, e.g. another atom of the same element. The wave emitted by it will 
have a time dependence throughout space of the form 


where 



(51.15) 

(51.16) 


Thus the two waves emitted by identical sources but originating from places of 
different gravitational potential have periods differing by 


Tt-T, 


u i -u 1 


(51.17) 


If U 2 is the potential on the Sun and U 1 the potential on the Earth we have 
U 2 > U i and the numerical value of the factor of T 0 in (51.17) is approximately 
equal to 


— ■ 2 Ul = 2 X 10- 6 (51.18) 

Thus the wave lengths of spectral lines originating on the Sun must be dis¬ 
placed relative to the corresponding lines produced on the Earth by two parts 
in a million towards the red end of the spectrum. 

However, one must note that the emission of spectral lines on the Sun takes 
place in physical conditions different from those on Earth and that the change 
of period due to the difference of gravitational potentials is to a great extent 
masked by other corrections. 

There are, however, certain super-dense stars, such as the companion of 
Sirius, which have a density tens of thousands of times greater than that of 
water. On their surfaces the value of gravitational potential is significantly 
greater than on the surface of the Sun—twenty times greater in the case of the 
companion of Sirius—for such stars the correction due to the difference in 
gravitational potential becomes very appreciable and can be detected experi¬ 
mentally. 

Recently (1959) the influence of the gravitational potential on the frequency 
of emitted light was successfully revealed in terrestrial conditions by making 
use of the extremely sharp lines corresponding to nuclear levels. These sharp 
lines are-obtained in the emission of gamma-quanta by atoms irt crystals in those 
emission processes which are not accompanied by emission or absorption of 
phonons. In such radiation the recoil momentum is transferred to the whole 
crystal, as a result of which the energy transfer to the crystal proves to be 
negligibly small and the emitted gamma-quantum possesses practically the same 
frequency as it would have had without recoil. This phenomenon was dis¬ 
covered by Mossbauer in 1958(46). We shall not discuss the relevant theory 
here, but will mention that the crystal temperature has a decisive influence on 
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the probability of “ phononless ” radiation, but not on its frequency or line 
breadth. Conversely, in radiation accompanied by phonons (which also occurs) 
a considerable line broadening takes place as a result of the Doppler effect, 
connected with the thermal motion of the atoms in the crystal. 

The frequency of the gamma-radiation in the phononless process is in reso¬ 
nance with the absorption frequency of the same line in the absorber crystal, 
and this resonance is so sharp that a relative change of one of the frequencies 
by a quantity of order 10“ 16 is sufficient to destroy the resonance. With such 
a sensitivity the change in the potential of terrestrial gravitation in a rise of, 
say, 10 m above the surface of the Earth becomes detectable; for, putting 
g = 9.8 m/sec 2 and h — 10 m we get 

U2 - Ul =- 9 \ = ll x 1<T 16 (51.19) 

c 2 c 2 

Thus, if the absorber atom is above the emitting atom its resonance frequency 
will be greater than the frequency of the gamma-quantum reaching it from below 
(as it rises, the gamma-quantum, so to speak, loses energy); at a height of 10 m 
the relative frequency difference is given by (51.19). This difference can be com¬ 
pensated, and therefore measured, by using the Doppler effect produced by 
moving the crystal as a whole. Here one has to use the transverse (quadratic) 
Doppler effect, because the longitudinal (linear) effect would be too coarse a 
means of compensating the frequency shift in such precision experiments. 

The theory of the Doppler effect is given in Section 13; the formula for the 
transverse effect is found from (13.16) for v r = 0. As is clear from the dis¬ 
cussion of Section 13, the frequency shift in the transverse Doppler effect occurs 
as a result of the relation between proper time c and coordinate time t, while 
there is no correction due to the need to express the instant t of emission in 
terms of the instant t* of signal reception, provided the radial velocity is zero. 
Therefore equation (15.07) may be used directly to derive the condition that the 
gravitational effect be compensated by the transverse Doppler effect. This 
formula can be written as a relation between the frequencies; we put v 0 for the 
proper frequency of the quantum and v for the local frequency that corresponds 
to coordinate time. We then get 

-= fl -4(5 ,,2 + C7 )l ( 5L2 °) 

v 0 [ c 2 \2 }\ v 

If the lower point is the position of the stationary crystal and has the potential 
U 2 , while the potential at the upper point is l\ and the crystal there is moving 
with the transverse velocity r, then the atoms in both crystals will be in 
resonance (v 2 = Vi) if the condition 

|« 2 + U x = U 2 (51.21) 

is satisfied. Since we have U 2 — U 1 = gh, this gives 

= 2 gh (51.22) 

We see that the experimentally established presence of a frequency shift in 
the gravitational field and the possibility of compensating it by a transverse 
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Doppler effect provide a direct verification of equation (51.07), and by the 
same token, of equation (51.10) for the squared interval. 

Thus, one can consider that the value we have found for the coefficient dt 2 
in the metric form is in agreement with experimental findings. 

52. Einstein’s Gravitational Equations 

Einstein’s theory of gravitation in its restricted, non-cosmological, form has 
the following basic idea. 

The geometrical properties of real physical space and time correspond not 
to Euclidean but to Kiemannian geometry. In Chapter III we discussed the 
basic postulates of this geometry. Any deviation of geometrical properties 
from their Euclidean, or to be precise, pseudo-Euclidean form appears in 
Nature as a gravitational field. The geometrical properties are indissolubly 
linked with the distribution and motion of ponderable matter. This relation¬ 
ship is mutual. On the one hand the deviations of geometrical properties from 
the Euclidean are determined by the presence of gravitating masses, on the 
other, the motion of masses in the gravitational field is determined by these 
deviations. In short, masses determine the geometrical properties of space 
and time, and these properties determine the movement of the masses. 

We shall now attempt to formulate these ideas mathematically. 

In the previous section we saw that in a certain coordinate system, which 
for practical purposes coincides with an inertial frame of Newtonian mechanics, 
the Newtonian potential of gravitation U enters the coefficient of dt 2 in the 
expression for the square of the interval, i.e. the coefficient g 00 of the general 
expressibn 

ds 2 = 9^ dx^ dx v (52.01) 

On the other hand, in Newtonian approximation the gravitational potential 
U satisfies Poisson’s equation 

AU — — 47ryp (52.02) 

The required generalization of Newton’s theory of gravitation must be covariant 
with respect to arbitrary coordinate transformations. Therefore it is impossible 
to regard as a generalization of the Newtonian potential a term in the coefficient 
g 00 or this whole coefficient; instead the whole set of coefficients g must be 
taken into consideration and must appear as the generalization of Newton’s 
potential. The fundamental metric tensor must satisfy a set of equations that 
are generally covariant and in the Newtonian approximation one of them must 
go over into Poisson’s equation for the potential U. The total number of 
equations must, generally speaking, be equal to the number of unknown 
functions, i.e. to the number of components of the tensor g which is ten. 

On the left-hand side of Poisson’s equation there is a second order differential 
operator, the Laplace operator, acting on U. Therefore, the simplest generally 
covariant generalization of this left-hand side will be a tensor which involves 
linearly the second derivatives of the metric tensor g . 

Such tensors are the curvature tensors (either of second or fourth rank). 
•The fourth rank curvature tensor fl ^ is unsuitable because its components 
do not contain expressions which could be generalizations of the Laplace operator 
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acting on U. Also, it has too many components, the number being twenty, 
twice as many as there are unknown functions. | Therefore, there remains the 
second rank curvature tensor which has just the right number of components. 

On the right-hand side of Poisson’s equation the mass density p appears. 
A generalization of the mass density which has the required tensor character 
is the mass tensor T^ whose invariant is equal, to the invariant mass density. 

We are thus led to the conclusion that the required generalization of Poisson’s 
equation must be a relation between the second rank curvature tensor 
and the mass tensor T I1V . 

In the previous chapters we saw that in the absence of gravitational fields 
the divergence of the tensor T must vanish 

Vjy* = 0 (52.03) 

We shall retain this equation for the general case, postponing the discussion 
of questions connected with it (questions of the energy of a gravitational field, 
of the integral form of conservation laws, etc.) until Chapter VII. 

But we established at the end of Chapter III that the tensor 

O v = R^ — Ig^R (52.04) 

which is known as Einstein’s tensor, or the conservative tensor, has the remark¬ 
able property that its divergence is identically zero. 

V v == 0 (52.05) 

Therefore, if we put 

R L ' J - | g^R = —XT'” (52.06) 

where x is a constant, equations (52.03) for the mass tensor will be a conse¬ 
quence of (52.06). 

As we know, the metric tensor g itself also satisfies (52.05), therefore 
we could add to the conservative tensor on the left-hand side of (52.06) a tensor 
of the form a where X is a constant, without violating (52.03). 

We saw in Section 31* that if one imposes purely local conditions on the mass 
tensor (the condition that it should depend on the field components or other 
functions of state, but not on the coordinates; the vanishing of its divergence 
by virtue of the field equations) it is only determined apart from two constants. 
To be more precise, if the tensor T' ,rj satisfies the conditions stated, these are 
also satisfied by 

T'^ = a T^ + W (52.07) 

Here the constant a depends on the choice of energy unit, the constant (3 also 
on the conditions at infinity. If in supplementation of the local conditions one 
demands that at infinity, where the field vanishes, the mass tensor should also 
be zero, then |3 will be zero and the mass tensor is determined uniquely. 

t It is true that even an excessive number of equations for the tensor might prove 

to be compatible as is the case for a space of constant curvature (equation (49.12)) but in 
that case the equations permit the metric to be determined purely locally, i.e. without using 
boundary conditions. They therefore have a character different from that of Poisson’s 
equation for which boundary conditions are essential. 
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These results were established for the case of a Euclidean metric, but one may 
expect that they will be true also for the general case, which differs from that 
discussed in Section 31* in two respects. In the first place, the ordinary de¬ 
rivatives with respect to the coordinates in the field equations will be replaced 
by tensor derivatives. In the second place, the quantities (f L ' J will be added 
to the functions of state (with the statements that their tensor derivatives 
vanish taken, formally, to be their field equations). However neither of these 
facts can essentially change our conclusion on the degree of uniqueness of the 
mass tensor determined from local conditions. The possibility of introducing 
two constants a and S is directly evident in the general case too; one may thus 
expect that our conclusion on the uniqueness of the mass tensor also remains 
valid (of course, it would be desirable to have a formal proof).f 

Replacement of the tensor T' L '‘ by the linear function (52.07) of itself corre¬ 
sponds to replacing the gravitational equations (52.06) by 

RT - \(f'R = -xr v - X<T (52.08) 

The constant X is called the cosmological constant. It is clear from these re¬ 
marks that the question of the value of X acquires a definite meaning only after 
the conditions are formulated by which the tensor T uv is to be defined uniquely. 
Such conditions will necessarily be of non-local character; they can therefore 
only be formulated starting from definite assumptions about the character of 
space-time as a whole. 

At the beginning of this section we stated the basic assumptions of Einstein’s 
theory of gravitation in its limited (non-cosmological) formulation. According 
to these the concept of spatial infinity retains its meaning, space-time at infinity 
being pseudo-Euclidean (Galilean). Deviations from Euclidean character are 
observed only at a finite distance from massive bodies. But in this case the 
mass tensor may continue to be subjected to the requirement stated for the 
case when the whole of space-time was, assumed pseudo-Euclidean. We can 
demand that at infinity, where the field vanishes, the mass tensor should also 
become zero. It then has a uniquely defined meaning to inquire after the value 
of the cosmological constant and the answer can be based on the following con¬ 
sideration. According to our basic postulates, the absence of a gravitational field 
signifies the absence of deviations of the geometry of space-time from the 
Euclidean, and therefore also the vanishing of the curvature tensor R 1 ’ 1 and of 
its invariant R. On the other hand, the gravitational field will be absent if the 
mass tensor T IXJ is zero everywhere. Therefore the equations T |rJ =.- 0 and 
iP* v = 0 must certainly be compatible and this is only possible if the equations 
relating to T |iV do not contain the term \g v '‘ (i.e. if X = 0). 

Thus, given the assumptions formulated at the beginning of this section and 
given our definition of the mass tensor, the appropriate generalization of 
Poisson’s equation for the potential will just be equations (52.06). 

As for equations (52.08), they should be used if the problem is stated cos- 
mologically, in which case the concept of spatial infinity is inapplicable and the 
tensor T M,V contains an unknown constant (3, even after units have been fixed. 
According to the value of this constant the value of the so-called cosmological 

t A proof of uniqueness for the electrodynamic and the electromagnetic cases is given in 
the papers by C. Jankiewicz [47]. 
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constant X must be selected; it is evidently related to (3. The choice of some 
particular value of X for a given normalization of J vv represents a special hypo¬ 
thesis, which must be introduced explicitly; this is true also for the value 
X = 0. 

Returning to the non-cosmological case and to equations (52.05) we may 
assert that under the conditions stated (correspondence with Poisson’s equation, 
general covariance, linearity in the second derivatives of the g w \ identical 
vanishing of the left-hand side of (52.05) and Euclidean character in the absence 
of masses) these equations are unique. 

The equations (52.06) are called Einstein’s gravitational equations ; they play 
a fundamental part in the theory of gravitation. They will be examined in the 
following sections. 

53. The Characteristics of Einstein’s Equations. The Speed of Propa¬ 
gation of Gravitation 

We begin our discussion of Einstein’s gravitational equations 

RT - \g^R = - xTv* (53.01) 

with the derivation of the first order equation for their characteristics. From a 
physical point of view the equation of the characteristics represents the propa¬ 
gation law for the wave front of a gravitational wave. 

Multiplying (53.01) by g^ and summing we obtain the relation 

R = xT (53.02) 


connecting the invariants of the curvature tensor and of the mass tensor. 
This relation enables us to write the gravitational equations in the form 

R^ = - x(T^ - ig^T) (53.03) 

In Appendix B the contravariant curvature tensor 2? ttv is shown to be expressible as 

= _ y - P"' + (53.04) 

CX (X OX p 

where ]>> ap is the quantity obtained from by raising s uffix es 

1>«S = s°V°rj, (53.05) 

Therefore, the last term in (53.04) does not involve second derivatives but is a 
homogeneous quadratic form in the and hence also in the first derivatives 
of the metric tensor. 

Second derivatives appear in the first term and also in the T tlv , but the 
latter dependence is only through first derivatives of the quantities 

P 1 = (53.06) 


which were introduced in Section 41. We recall that the d’Alembertian of any 
function may be written in the form 

W _ r vii 

8 : r. 




8x a 8x 3 


(53.07) 
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or, alternatively, as 


whence 


□<p = 


1 

7FF) 



(53.08) 


r “ = - ~77~~n\ ir (V( “ 9) 9°*} (53.09) 

V( — 9) 

and also 

r» = - Qc a (53.10) 

The P“ are obtained from the F v by a rule which is formally identical with the 
rule for forming the symmetrical contravariant derivative of a vector : 


or in detail 


P" = KVT* + V v T 11 ) 


„ 1/ ap 

r “’ = 2 -rs; +s " 


ap 

dx 

a 



(53.11) 

(53.12) 


Of course, since F v is not a vector the are not a tensor. This circumstance 
proves very useful in simplifying Einstein’s’ equations. 

Einstein’s equations are generally covariant and therefore permit coordinate 
transformations involving four arbitrary functions. Suppose the -equations 
are solved in some arbitrary system of coordinates. We can then go over to 
other coordinates by taking as independent variables four solutions of the 
equation Dp = 0. These solutions may be chosen in such a way as to satisfy 
the inequalities to which the cf™ must be subject, according to Section 35, and 
they may also be subjected to some additional conditions. But as long as each 
of the coordinates x 0 , x 1; x 2 and x 3 satisfies the equation Dr a = 0 we shall 
have in that system 

r* = 0 (53.13) 

and therefore also 


r«" = o 


(53.14) 


We shall call such a coordinate system harmonic. At the moment we are not 
interested in the question of the uniqueness of the harmonic coordinate system 
or rather in the additional conditions which could guarantee uniqueness. This 
question will be considered in Section 93. Here it is only important to note that 
the equations (53.13) are compatible with Einstein’s equations and that they 
do not impose any essential limitation on the solutions of the latter, serving 
only to narrow down the class of permissible coordinate systems.] - 
Under the conditions (53.13) the expression for the 2P* V simplifies, becoming 

d 2 a ' 1v 

= - i 9^ -,-V- + (53.15) 

OX a CXp 

Hence second derivatives only appear combined in the d’Alembert operator 
acting on the single quantity which has the same indices as the particular 
iP" on the left-hand side. 

The form of the equation of the characteristics for any given system of 
equations depends only on the terms containing the highest occurring order 


t The conditions = 0 were first introduced by de Donder [16] and by Lancaos [17]. 
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of derivatives. In the case of the system (53.01) and (53.13) these terms are 
just those involving the d’Alembertian. 

Therefore the system of equations of gravitation will have the same characteris¬ 
tics as d’Alembert’s equation, 

Dj; = 0 (53.16) 

and these are easily found. As shown in Appendix C they have the form 


9 


8(o dco 

^-= 0 

8x^ 0x v 


(53.17) 


where <o(x 0 , x v x 2 , x 3 ) — const. (53.18) 

is the equation of a wave fr at, i.e. the equation of a moving surface on which 
any discontinuities of the gravitational field must lie. 

The equation (53.17) for the propagation of a gravitational wave-front is the 
same as the corresponding equation for the front of a light wave in empty 
space on which the whole theory of space and time was based, t Briefly one 
can say that gravitation is propagated with the speed of light. 

That in Einstein’s theory gravitation is propagated with the speed of light 
is a fact of fundamental significance. It shows that the assumed form of the 
gravitational equations is in agreement with the general postulate of the 
Theory of Relativity according to which there exists a limiting velocity for 
the propagation of all types of action, namely the velocity of light in free space. 
The existence of a finite propagation velocity for gravity removes the contra¬ 
diction inherent in Newton’s theory of gravitation with its admission of 
instantaneous action at a distance. 


54. A Comparison with the Statement of the Problem in Newtonian 
Theory. Boundary Conditions 

In Newton’s theory of gravitation the gravitational potential satisfies the 
equation 

AU = — 4tc Y P (54.01) 

and tends to zero at infinity in such a way that 


lim rU - y M (54.02) 

r—»■ co 

where M is the total mass of all the bodies of the system in question and is 
equal to 

M = J pdxdydz (54.03) 

Einstein’s theory, which is based on the gravitational equations 


— Ig^R = - y.T^ 


(54.04) 


must, in first approximation, give the same result as Newton’s theory. Newton¬ 
ian theory is applicable to such mass distributions for which the total mass, 


t When we derived that law from Maxwell’s-equations in Section 3, we assumed space-time 
to be Euclidean. But, according to a remark at the end of Ap|>endix E, the same result can 
be obtained without this assumption, starting from the generally covariant form of Maxwell’s 
equations given in Section 46. 
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given by the integral (54.03) taken over all space, remains finite. This condition 
is in particular satisfied by any mass distribution of insular character. We 
use this term to describe the case that all the masses, of the system studied 
are concentrated within some finite volume which is separated by very great 
distances from all other masses not forming part of the system. When these 
other masses are sufficiently far away One can neglect their influence on the 
given system of masses, which then may be treated as isolated. 

In formulating Einstein’s theory we shall likewise start from the assumption 
that the mass distribution is insular. This assumption makes it possible to 
impose definite limiting conditions at infinity as for Newtonian theory, and so 
makes the mathematical problem a determined one. 

Theoretically, other assumptions are also admissible. For instance, one can 
assume a mass distribution which on the average is uniform throughout space. 
Such a point of view is appropriate to the study of distances so enormous that 
in comparison even the distances between galaxies are taken to be very small. 
Very little is known of the mass distribution over such great distances and 
therefore a theory dealing with them will necessarily be less reliable and less 
capable of experimental verification than the theory of smaller scale astro¬ 
nomical phenomena. 

The bulk of this book will be devoted to the case of insular distributions of 
masses. The assumption of uniform distribution will be considered only in 
Sections 94 and 95, where we give the theory of Friedmann-Lobachevsky space 
to which this assumption leads. 

We shall thus now assume that space-time is in the main Euclidean, or 
rather pseudo-Euclidean, and that any deviation of space-time geometry from 
Euclidean geometry is a result of the presence of a gravitational field. Where- 
ever there is no gravitational field, geometry must be Euclidean. For an insular 
distribution of masses the gravitational field must tend to zero at infinity and 
therefore we have to assume that at points far removed from the masses the 
geometry of space-time becomes Euclidean. However, when geometry is 
Euclidean there exist Galilean coordinates x, y, z and t, in terms of which the 
square of the interval has the form 

ds 2 = c 2 dt 2 — ( dx 2 + dy 2 + dz 2 ) (54.05) 

Since experiment shows that the geometry of space-time nowhere deviates 
greatly from Euclidean geometry one may expect that there should exist in 
the whole of space variables in terms of which the square of the interval deviates 
but little from (54.05). In the following we shall give a more precise definition 
of these quasi-Galilean coordinates. 

We note that Newton’s theory is simplest to formulate in just these Galilean 
coordinates, i.e. in an inertial frame of reference. Consequently Einstein’s 
theory, which is its generalization, should be compared with it in terms of 
coordinates with as similar properties as possible. 

Newtonian theory is non-relativistic and in passing from a relativistic theory 
to a non-relativistic one, it is essential to single out the speed of light as a large 
parameter. Therefore we shall no longer introduce the quantity c into the 
definition of the time-coordinate ; instead of (35.03) we shall now write 

x 0 — t ; x x = x ; x 2 — y, x 3 = z (54.06) 
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Thus henceforth the variable x 0 will mean simply the time t and not ct as 
previously. 

The expression (54.05) for the square of the interval now appears as 

ds 2 = Hx 2 - (dxl +dx\+ dxl) (54.07) 

This must be valid at sufficiently large distance from the masses, where the 
geometry is Euclidean. 

Comparing with the general expression 


ds 2 = g^dx Jx v 


(54.08) 


we find that the g must have the following limiting values at infinity 

(< 7 oo)oo = ° 2 1 (s , 0i)oo = 0 /K4 

(?«)»= -8|* (t, k = 1,2, 3) ^ 1 

The corresponding limiting values of the contravariant components of the 
metric tensor will be 


(Ooo = 4; (9 01 )oo = 0; (9 ik U = - S|* (54.10) 

e 2 


These are then to be considered as the boundary conditions on the metric 
tensor. 

However, the number of boundary conditions so far stated is insufficient; 
some additional ones must be added which characterize the asymptotic behaviour 
of the differences — (g^) x at large distances from the masses. 

In the previous section we saw that, at least if P = 0, Einstein’s equations 
are of the type of the wave equation, because their main terms involve the 
d’Alembert operator. Outside the mass distribution the tensor T^ vanishes 
and the equations take on the form 


#" = 0 

where, provided P' = 0, the tensor ft 1 ' 1 has the form 

d 2 g^ 


ft" = - i 9 


,0tP . 


dx a dxp 


_|_ JXa&pv 


ap 


(54.11) 

(54.12) 


We assume that at large distances the differences <p v — (<p v )«> and their first 
and second derivatives tend to zero as 1/r, where r = \Z(x\ + x\ + a: 2 ). (This 
assumption will be justified in the following.) Then at large distances the 
second term of (54.12), being a homogeneous quadratic form in the first deriva¬ 
tives, will tend to zero as 1/r 2 . As for the term involving the d’Alembertian, 
the coefficients in it can be replaced by their limiting values to the same 
approximation. After these simplifications we get 


RT 


2c 2 8x, 


d 2 gv* 1/8Y V dV" 0y* v \ 
+ 2\ 8xl + Ar* + fl~ 2 J 




dxl 


(54.13) 


A more complete investigation of the asymptotic behaviour of the <p v will 
be given in Section 87. It shows that the asymptotic form of g^ is influenced 
by the terms of order 1/r 2 omitted from (54.13) but that qualitatively the 
behaviour of the difference g^ — (gY™ wifi be the same as the behaviour of a 
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function ^ satisfying the wave equation 

= 0 (54.14) 

c 2 0< 2 Y 


where A is the usual, Euclidean Laplace operator. 

We are interested in solutions of the wave equation (54.14) which correspond 
to outgoing waves dying off at i nfini ty. They have the asymptotic form 

where n is a unit vector with the components 

x y z 

n x — -; n y — -; n z = - (54.1o) 

r r r 


and/is an arbitrary function. The function / and its derivatives with respect 
to all its arguments are assumed finite. The argument n gives the dependence 
of / on the direction along which a point recedes to infinity. 

Other possible solutions of the wave equations must be discarded for physical 
reasons, for in our statement of the problem the system is considered to be 
isolated. This means that no waves impinge on it from outside, all waves have 
bodies of the system as their sources and, since in a system of insular type all 
bodies are concentrated in a finite region, all waves originate in this region 
and so have the asymptotic form (54.15) at large distances from the region. 

The conditions that a solution of the wave equations should at large distances 
have the form indicated can be stated in the differential form 


/aW) law* 

\ 8r c 8t J 


c a / (54.171 

This condition must hold for r -> oo and all values of t 0 = t + r\c in an 
arbitrary fixed interval. It can be called the condition of outward radiation. 
It ensures the uniqueness of the solution provided it is associated with the 
requirement that the function (J; and its first derivatives with respect to x, y, z 
and t should be everywhere bounded and should die off at infinity as 1/r (see 
Section 92). 

We note that the above considerations refer strictly speaking to the ordinary 
wave equation (54.14) and not to Einstein’s equations. Therefore the asymptotic 
form of the difference 


-r - (<r)oo = + ( 54 . 18 ) 

will, in fact, differ somewhat from (54.15). However, a slightly modified condi¬ 
tion of outward radiation written in the differential form (54.17) will still be 
valid for (54.18). 

Summing up, we can say that in our statement of the problem the metric 
tensor must satisfy the condition of being Euclidean at infinity and also the 
condition of outward radiation. 


55. Solution of Einstein’s Gravitational Equations in First Approxi¬ 
mation and Determination of the Constant 

To compare the gravitational theories of Einstein and of Newton we must 
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first of all determine the constant x which enters Einstein’s gravitational 
equations 

R^ — \g^R = - xT^ (55.01) 

The value of this constant can be found by comparing the expression for the 
square of the interval derived in Newtonian approximation in Section 51 with 
that obtainable by approximately solving Einstein’s equations. 

For the mass tensor on the right-hand side of (55.01) we may take the 
approximate expression corresponding to a Euclidean metric that was dis¬ 
cussed in Section 32. There the mass term was given explicitly for the case 
of an elastic body. In taking over the equations of that section we must re¬ 
member that we now understand x 0 to mean simply the time f and not ct as 
previously. Therefore the previous T 00 will in the new notation be equal to 
c 2 T 00 and the previous T oi to the new cT°\ the meaning of T ik being unaltered. 
Thus if a? 0 — i the equations (32.34) become 

c 2 T°° = p + \ (|pu 2 + pll) 
c 

c 2 T M = p Vi + — {v ( (|py 2 + pll) — 2 PikVk) (55.02) 

C h-l 

c 2 T ik = pvtv/c — p lk 

In the present approximation we must disregard the terms corresponding to the 
density and current of energy—Umov’s scalar and vector—and we write simply 

c 2 jT 00 = p ; c 2 T* = pv { (55.03) 


To the same accuracy to which this is valid we may replace the invariant of 
the mass tensor by the value 

T = p (55.04) 

Equations (55.03) and (55.04) enable us to calculate the approximate values 
of the tensor components entering the right-hand side of Einstein’s equations 
written in the form 

Rr = - x(2> v - \g^T) (55.05) 


which was given in (53.03). Using the Galilean values of the g' L ' > we get 

Too _ ig oo T = _L p 

T oi - \g*T = 1 pVi (55.06) 

T ik — \g ik T = |pS{* 


On the other hand, according to (54.13), if we use harmonic coordinates 
we have approximately, 


Rr = I A g*" 


1 8Y v 

2c 2 ~dF 


(55.07) 


where A is the usual Euclidean Laplace operator. As we shall be interested in 
a quasi-static solution we can discard the term involving the second time 
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derivative. Inserting (55.07) and (55.06) into (55.05) we have 

V--£p 

A$« = - ^ ?v t (55.08) 

c 36 

A— —xp8<* 

We now refer back to the expression for the squared interval in Newtonian 
approximation. According to (51.10) we have then 


9oo = c 2 - 2 V (55.09) 

where TJ is the Newtonian potential. In this approximation the remaining 
components of the metric tensor are to be replaced by their Galilean values. 
Using the relation 

3 

9oo9 00 + J 9*0* = 1 (65-10) 

1 = 1 


and the fact that the products g oi g i0 are very small compared to unityf we can 
take 


and therefore 


hence 


9oo9 00 = 1 

(55.11) 

1 W 

q 00 — -1- 

y c 2 T c 1 

(55.12) 

3 the equation 

A U = — 47typ 

(55.13) 

A,°°=-^P 

(55.14) 


Comparing this with the first equation in (55.08) we see that the two are 
coincident, if Einstein’s gravitational constant x is related to Newton’s constant 
y by the relation 


x 


8tcy 


(55.15) 


The Newtonian potential U is that solution of (55.13) which satisfies the 
appropriate boundary conditions at infinity. As is well-known that solution 
can be written in the form of a volume integral: 


U = y J , r , - y dx' dy' dz' (55.16) 

Side by side with this Newtonian potential we introduce the functions 


which satisfy 


U * = tj y^dx'dy'dz' 
A U{ — — 4Trypi’< 


(55.17) 


f An estimate of these terms will be given later. 


(55.18) 
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and also the conditions at infinity. In analogy with the corresponding electro¬ 
magnetic quantities these functions may be called gravitational vector poten¬ 
tials. Now the solution of (55.08) can be written as 



(55.19) 


We have used (55.15) to eliminate x. 

We note that U has the dimensions of a velocity squared and the Z7< the 
dimensions of a velocity cubed. In estimating the orders of magnitude of 
quantities we can take U to be of the order q 2 and the Z7< of the order q 3 , where 
q is some speed much smaller than the speed of fight. 

It is now purely a matter of algebra to obtain from the contravariant com¬ 
ponents of the metric tensor its covariant components, its determinant, etc. 

To simplify the algebraic manipulations we introduce a system of quantities 


i ff (tiff 0 k i i- 

aik = ~ gik -; o’* = — 9 

(55.20) 

9oo 


where i, k = 1, 2, 3. It is easy to verify that 


3 

2 a im amk = $ik 

(55.21) 


w»“l 


The set of quantities aac may be interpreted as a three-dimensional spatial 
metric tensor, but only its algebraic properties are of importance to us here. 


If we put 

a — Det anc 

(55.22) 

and therefore 

- = Det a ik 
a 

(55.23) 

we get 

g — ag 00 

(55.24) 

It follows directly from the definitions (55.20) that 



3 

9oo9 ok = 2 amk 9mo 

(55.25) 

and also 

m = 1 



9io = 5 , oo2 a< ^° Ac 

(55.26) 

If the £ |XV have the values (55.19) it follows that 



a ik = (l - 

(55.27) 


and therefore 
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/ 2EA 

aoc =11+ — 

Noting the form of g 00 we get 

(55.28) 

g<yiflik = tic 

with an error of order higher than U/c 2 . 

Hence, with the same relative error, we have 

(55.29) 

9 i0 = cy° 

(55.30) 

Using these results we obtain for the covariant components of the metric tensor 

9oo — ° 2 217 

4 TT 

9oi ~ ~2 

9 ik = +5) S<t 

(55.31) 


Knowing the approximate values of the g oi and the y oi we can now verify the 
accuracy to which the use of (55.11) was justified. We have, approximately 

g a og 00 = 1 - ^ 2 ^ (55 ' 32) 

i=l 

If the U{ are of order <f the above expression differs from unity by quantities 
of order g ,6 /c 8 . Therefore (55.11) may be used not only in this but also in the 
next higher approximation in Ujc 2 or v 2 /c 2 . We note that (52.26) leads to 

3 

9oo9 00 = 1 - 9oo y a ik g oi g ok (55.33) 

Here g 00 is positive and the double sum is a positive definite quadratic form, 
therefore we shall always have, quite rigorously 

9oo9 00 < 1 (55.34) 

though, as we have seen, the deviation from unity of the left-hand side is 
exceedingly small. 

We now state the value of the determinant g and of \/(— 9) time 8 the contra- 
variant components of the metric tensor. We shall write 



9 |lv = V(-9)-9^ 

(55.35) 

We then have 

— g = c 2 + 4C7 

(55.36) 

and therefore 

217 

(55.37) 


Vi- 9) = e + — 

c 

Hence, using (55.35) 

„ 1 4U 



9 00 = —1—r 



c c 3 



0 W = - 3 Ui 
c 3 

(55.38) 


Q** = — c8a 



We must now estimate the magnitude of the neglected terms in Ry v which 
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are quadratic in the first derivatives. 

These terms are of the form r v *° e rj| 0 . To calculate approximate values of 
the Christoffel symbols we could use the approximate form of the metric tensor 
that has just been derived. However, we shall not perform these calculations 
here as the quadratic terms will be determined in detail in Chapter YI where 
we shall solve the gravitational equations in the next approximation. Here 
we only need the order of magnitude of the quadratic terms. It turns out 
that the terms in R 00 and R 01 will be of sixth and those in R tk of fourth order 
in 1/c. In our present approximation these terms have no influence. 

It remains to verify whether the conditions which ensure that the coordinates 
are harmonic, 

r - ““( v ^' 4; (V(_i ' , ' ? ‘’ , “ 0 <56 - 39) 

are satisfied to the approximation required. Let us first make it clear to what 
accuracy we require them to hold. If we do not omit the terms in P 1 ' in the 
expression (53.04) for R v -'“ but instead retain them to the accuracy corresponding 
to the approximation (55.07) for the other terms, we obtain in place of (55.07) 


R oa = |A g 


1 8 2 g 00 
2c 2 ~8t r 


i er° 

c 2 dt 


= £A g 01 - 
Rtk = |Agile _ 


1 8 2 g M 
2c 2 ~8t 2 ~ 
1 8 2 gik 


+ 


1 /ar° _ i 

2 \ 8xt c 2 


i /an 


+ H 


+ ■ 


ar<\ 
~8tJ 
ar* 


2c 2 dt 2 2 \8xic dxi 


(55.40) 


In order that the previously neglected terms in P should really be small com¬ 
pared to terms of the. type Ag^ which were retained, it is necessary that T 0 
should be of a higher order of smallness in c than 1/c 4 and P of a higher order 
than 1/c 2 . These conditions are indeed satisfied. For it is directly evident 
from (55.38) that the P will be of fourth order in 1/c. As for P, the terms of 
fourth order in it are 


P>= - 


4 

c 4 


(eu ^dUA 

\ et + £ i dx ( / 


(55.41) 


These must vanish. 


Therefore the equation 

8U y 8U, 
dt ^ 8x( 


= 0 


(55.42) 


must hold. As is evident from the definition of V and the I7< (either by means 
of differential equations with boundary conditions or in terms of volume 
integrals) this equation is indeed satisfied as a consequence of the relation 


3 


S+2, 


8(pvi} 

8x{ 


0 


(55.43) 


which expresses the law of mass conservation in Newtonian approximation. 

Thus the expressions just derived for the metric tensor satisfy to first approxi¬ 
mation not only the gravitational equations but also the “ harmonic con¬ 
ditions ”. In addition they obviously satisfy the boundary conditions at 
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infinity. The expression for the square of the elementary interval corresponding 
to the expressions derived is 

(da% + dx\ + dx\) 

Q 

+ — (U 1 dx 1 + U 2 dx 2 + U 3 dx 3 ) dt (55.44) 
c 

Usually the terms involving the products dxt dt are of no importance. Omit¬ 
ting them we get the expression 

ds 2 — (c 2 — 2U) dt 2 - |l 4—-^(da^ + dx\ + da%) (55.45) 

which involves only the Newtonian potential. This expression has already 
been quoted without proof in Section 51, equation (51.11). 

56. The Gravitational Equations in the Static Case and Conformal 
Space 

The metric tensor is called static if its components do not depend on the 
time coordinate x 0 — t, so that 

= 0 (56.01) 

dt 

and if, in addition, 

g<H = 0 (i = 1,2, 3) (56.02) 

It is evident from physical considerations that if several masses are present 
they must be in motionf. Therefore a. static metric tensor can only occur in 
the case of a single mass. In spite of limited applicability, the static case is of 
some physical interest, first of all because it permits a deeper insight into the 
analogy with the Newtonian theory of gravitation (which is also a static theory) 
and also because in the static case it is easy to discuss the question of the 
uniqueness of the solution. Also, rigouros solutions of Einstein’s equations can 
be found in this case. 

Under the conditions (56.01) and (56.02) the time coordinate is determined 
uniquely, while the space coordinates permit a group of transformations among 
themselves. Therefore it is natural, in this problem, to use the apparatus of 
three-dimensional tensor analysis and to write the gravitational equations 
accordingly. Three-dimensional tensor analysis can be applied either directly 
to the spatial part of ds 2 , or else to this spatial part multiplied by some factor. 
Remembering the approximate form of (55.29) obtained from Einstein’s 
equations, we introduce into the spatial part a factor inversely proportional to 
the factor in the time part, putting 

ds 2 — c 2 F 2 dt 2 —- hue dxi dx k (56.03) 

V 2 

We shall consider the quantity F 2 to be a three-dimensional scalar and the 
quadratic form 

da 2 — hi k dx t dxjc (56.04) 


ds 2 = (c 2 -2U) dt 2 




t The problem of the motion of a system of masses is considered in detail in Chapter VI. 
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to be the squared length of arc in a certain auxiliary space, which we shall call 
conformal space. Three-dimensional tensor analysis will be applied to this 
conformal space. As may be seen by comparing (56.03) with (55.45), in the 
approximation in which the latter holds we can assume the conformal space 
to be Euclidean and the quantity V 2 to be related to the Newtonian potential U. 

Thus we shall have 


9 oo = c 2 V 2 , 9oi = 0, ' 
hoc 

9ik = -yi> 

and also 


(56.05) 


C 2 V 2 ’ 


9 0i = 0, 


gtk = —V%ik 


(56.06) 


Here the quantities hue and h ik are connected by 


h v W = hf (hf = 8f) (56.07) 

this relation being analogous to (37.18) for the g . Denoting by h the deter¬ 
minant of the htk we easily obtain 

V(-’-9)- y z Vh (56.08) 

Therefore we have 


V(—9) ' 9 00 = 
V( -9) ' 9 lk = 



— c-\/h ■ h ik 


(56.00) 


and the d’Alembert operator (41.11) applied to a function may be written as 

< 56 - 10 > 

where (A^)/» denotes the Laplace operator in the conformal space: 

(A y )h = ±-l-ly/h-K'*^L\ (56.11) 

y/h cxi \ dx k ) 

Hence we see that spatial coordinates that are harmonic in the four-dimensional 
sense will also be harmonic in the three-dimensional conformal space. 

We denote the four-dimensional Christo ffel symbols (formed from the metric 
tensor y wi ) by (r A J, ; and the. three-dimensional Christoft’el symbols (formed 
from the metric tensor ha) by (Similarly we shall attach suffixes g 
and h respectively to quantities that are tensors in relation to the metrics 

(i?nv) and fa*)- 

The four-dimensional Christoffel symbols and the curvature tensor can be 
expressed in terms of the corresponding three-dimensional quantities. 

These expressions will involve derivatives of the three-dimensional scalar V, 
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which we will denote by 

nv 

(V\ - h ik Vk 


dV 

Vk = -jr—, 

8x k 
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(56.12) 


The suffix h attached to the V 1 will sometimes be omitted for brevity. 
The Christoffel symbols with purely spatial indices will be 


(r U = 


J v 


(56.13) 


If one or all three indices are zero, the Christoffel symbols become zero: 

ou = °> <r?*>, = o, m, = o 

Finally, if two of the indices are zero we get 

(r*.), (rj,)* = 

Using the general formula 


Vi 

V 


(56.14) 

(56.15) 


8F P sr p 

Rt.... * ^ + r« r p 


8x„ 


8x„ 


Fa FP 

apt*- av a'j apt 


(56.16) 


we can express the four-dimensional curvature tensor in terms of a three- 
dimensional tensor and the covariant derivatives of the three-dimensional 
scalar V. Leaving out elementary, though rather tedious, calculations we obtain 
for the components with four spatial indices 

i,mk)g = (R l i,mk)h + y (V l k ) h — htk y(V\ n ) h 

+ h l k -y (1 im)h — h l m — (V ik)h — (hitnll'k ~ hikh l m ) -y^ (V j\ i)h, (56.17) 

where (Vik)h is the second covariant derivative of V with respect to the 
metric ( h): 

(Vtk)n^-f^--(T i lk ) h ^- (56.18) 

dxidxjc dXj 

and ( V J k ) h is the same derivative with a suffix raised: 

(n), = aw* 

If only one index is zero, we have 

(Kmk) g = o, (R l o. mk ) s = o. 

If two indices are zero we get 

(^0 ,mk)g = 0 


(K 


o k'g 


and also 


and finally 


(Ri.oklg Tr (Vik)h + 175 V ( V k 


(V,W)h 


y \ ’ ifc'n 1 y 2 ’ i’ k ik y 2 

(R 1 om) 9 - c*V*{VV\ + 2V'V k - h' t (V s Vi)} h 


(56.19) 

(56.20) 

(56.21) 

(56.22) 

(56.23) 
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It is now simple to form the contracted curvature tensor which enters 
Einstein’s gravitational equations. 

Using the formula 

(Ru), = ( 56 - 24 ) 

we obtain from (56.17) and (56.22) 

(Rtk)g = ( Rik)h — hnc-y(&V)ii + — VtVk + hik y {VjVi) h (56.25) 
where (AF)» is the Laplace operator applied to F: 

W.-(Vf).-d-A( Vi . i »£L) (56.26) 

On the other hand the formula 

(R 0 o) g = (Kko) g = - «0*), (56.27) 

gives, using (56.23), 

(R m ) g = —c 2 F 2 {F-(AF) ft - (FyF^)ft) (56.28) 

As for the mixed components of R , they vanish as a consequence of (56.20): 

(RJ g = 0 (56.29) 

By virtue of the relations (56.06) between the four- and three-dimensional 
metric tensors, equations (56.25), (56.28) and (56.29) lead to the following 
expression for the invariant (R) g : 

(R)g = -V*(R) h + 2F(AF)a - 4(F,F#)* (56.30) 

We denote by G WJ the conservative Einstein tensor 

^ = (V 9 -W (56-31) 

and by Ha the conservative tensor in the conformal space 

Hik = (Rik)h — \hik(R)h (56.32) 

The invariant of the latter is 

H = h<*H ik = -\{R) h (56.33) 

as a result of which we have 

(Rik)h = Hik — hikH (56.34) 

For the spatial part of the conservative Einstein tensor we obtain the simple 
expression 

Gik = Hik + yi FjFfc — hiky ^ (VjVl)h (56.35) 

which is remarkable for the fact that it does not contain second derivatives of 
the three-dimensional scalar F. 

The mixed components of the conservative tensor vanish, 

G i0 = 0 


(56.36) 
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while the component 6? 00 is given by 

G w = c*V*{-V*H - 2F(AF)ft + 3(F,Ff) ft ) (56.37) 

We go over to the formulation of the gravitational equations. We have just 
noted that the quantities Gw do not contain second derivatives of F. On the 
other hand, (56.28) shows that R 00 does not involve second derivatives of the 
hue . Therefore, if we write down the gravitational equations in a form solved 
with respect to R 00 and G ik the second derivatives of F and of hw will be 


separated from each other. By virtue of the general equations 

K, = -Mr*. - i g^T) (56.38) 

which are the covariant form of (53.03), we have 

h’oo = x(foo l 9oo T) (56.39) 

where T is the invariant 

T = (T) g = (T% + T\ + T\ + T\) g (56.40) 

Hence by virtue of (56.05) we get 

floo = -|X£ *V*(Tl -T\-T\- Tl) g (56.41) 

and, by using the value (56.28) of R 00 : 

F-(AF)» - (VjV% = \y.(Tl -T\-T\- T>) g (56.42) 
The equations for the spatial components are 

Hit + ~ ViV k - h ik -y 2 (V,Vt) h = -xTw (56.43) 

As regards the equations for the mixed components, 

G i0 = -x2h 0 (56.44) 


they are satisfied identically, because here the left-hand side is zero by virtue 
of (56.36) and the right-hand side is also zero, because the mass current is zero. 

The equations so obtained acquire a more pictorial form if one introduces 
some new symbols. 

We put 

H = -Ss (K — T\ — T\ — T\) g (56.45) 

The quantity p. can also be written in the form 


p = c 2 T° o + h^T tk (56.45') 

As may be seen by comparing this with the approximate expression (55.02) 
the quantity p represents a certain mass density; we shall see that it can be 
interpreted as the mass density in the conformal space. 

Further, we replace F by the quantity <f>, according to the formula 


F = tr+ 


(56.46) 
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so that the relation between the space-time metric and the metric in the con¬ 
formal space takes on the form 

ds 2 = c 2 e -2 ‘ f ’ dt 2 

we have 

H = , _ 

8xi * 


A <f> = 


+ 


e^do* 

(56.47) 

Vi 


V 

V)V* 

(56.48) 

F 2 



Therefore, the gravitational equations (56.42) and (56.43) may be written as 

(A <f>) h = - (56.49) 

Htk = + h ik (<t>j(f>i) h — y.Tik (56.50) 

The first of these equations is essentially Poisson’s equation for the New¬ 
tonian potential U. Indeed, the symbol A is a generalization of the Laplace 
operator, u is the mass density and, by (55.15), the constant /. is given by 


87T/ 

Therefore if we put 



the quantity U will satisfy the equation 

(AZ7) ft = — 4 tcyp. 


(56.51) 

(56.52) 

(56.53) 


which differs but slightly from equation (55.13) for the Newtonian potential. 
We can also put 

4>i = (56.54) 

c 

where c/i is a component of the gravitational acceleration. 

Let us now clarify the physical meaning of (56.50). Apart from a factor the 
terms involving the fi can be interpreted as gravitational stresses. If we put 

2 Uk - haitiWn = xT,* (56.55) 

we can replace (56.50) by 

H ik = —y.(T* k + T lk ) (56.56) 

The three-dimensional divergence of the tensor Tuc, understood as referring to 
the metric (hoc) is 

^T* k ) h = 2 -H^)n (56.57) 

and by (56.49) we have 

(V*T* ) ft - u<f>t (56.58) 

On the other hand, since Hik is a conservative tensor in the conformal space, 
its divergence is zero. Therefore, apart from its sign, the divergence of the 
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gravitational stress tensor is equal to the divergence of the tensor of elastic 
and other static stresses Tu~. Thus we have 

iy*T lk ) h = lift (56.59) 

These equations represent a generalization of the usual equations in the statics 
of elastic bodies in a gravitational field. 

The equations for the statics in conformal space, written in the form (56.56) 
stand in analogy to Einstein’s equations in space-time. In both sets of equations 
the left-hand side involves a conservative tensor, while on the right there is a 
stress tensor or its four-dimensional generalization. Here the gravitational 
stresses appear in explicit form only after space has been split off from time 
and after passage to the conformal space. 

The conformal space will be almost Euclidean. Indeed, as is seen from (56.54) 
and from the estimates (55.02) for the tensor T a the right-hand side of (56.56) 
will be of the order </ 2 /c 2 . This leads to the result that the deviation of the ha 
from their Euclidean values will be of the order C/ 2 /c 4 . This result is in agree¬ 
ment with the approximate formula (55.45), which was just the basis for 
introducing the conformal space. 

For empty space, when T = 0 and q — 0, equation (56.49) is a consequence 
of (56.50). It is easy to see this by equating the divergence of Ha to zero and 
using (56.57). 

If the mass tensor T a(j is zero in the whole of space, the only static solution 
of Einstein’s equations which has no singular points and which satisfies the 
boundary conditions will be the solution corresponding to Euclidean space and 
pseudo-Euclidean space-time. This can be shown in the following manner. 
In the case of empty space, equation (56.49) gives (A f)h = 0. This is an equation 
of the elliptic type for f , which represents a generalization of Laplace’s equation. 
The function <f> and its derivatives <f>t must be everywhere finite and continuous 
and at spatial infinity they must tend to zero. But the only solution of Laplace’s 
equation that satisfies these conditions is the solution <f> = 0. But then the 
derivatives <f>i will also vanish and therefore also expression (56.55). Since in 
addition Ta — 0 we also have Ha- — 0. Hence it follows that the curvature 
tensor of the conformal space is zero, and the space itself Euclidean (see 
Appendix G). 

57. Rigorous Solution of the Gravitational Equations for a Single 
Concentrated Mass 

In the case of a concentrated mass a rigorous spherically symmetric solution 
of the gravitational equations can be found. As we are dealing with a static 
case we can use the results of the foregoing section, and write ds 2 as 

ds 2 = c 2 V 2 dt 2 - ~da 2 (57.01) 

V 2 

da 2 = ha dxi dx k (57.02) 

If x v x 2 and x 3 are harmonic coordinates we can introduce spherical coordinates 
related to them by putting 
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x x — r* sin 3 cos <p 

x 3 = r* sin 3 sin cp (57.03) 

x 3 = r* cos 3 

The assumption of spherical symmetry implies that the expression for da 2 is 
of the form 

do 2 --- F 2 dr* 2 + p 2 (d3 2 + .in 2 3 dy 2 ) (57.04) 

where F and p are functions of r* only. The coefficient V must also be taken 
to depend only on r*. 

We note first of all that if we put 

F dr* = dr (57.05) 

we can reduce the general expression (57.04) to the case F = 1, so that 

do 2 = dr 2 + p 2 (d3 2 + sin 2 3 c If 2 ) (57.06) 

It is true that in doing this it may happen that the radius-vector r will be 
“ non-harmonic ”, in the sense that it is not related to the harmonic coordinates, 
x v x 2 and x 3 by equations of the form (57.03). But having formed Laplace’s 
equation for the quantities (57.03) with r* replaced by r one can always go over 
afterwards to a “ harmonic ” radius vector r*. 

For the metric (57.06) we get 


and therefore 


Hence 


hrr — 1 , 


h rr = 1, 

h** = 0 , 


— P > 
hr = °> 


h** = ~ 
? 2 

h« r = 0, 




h m = 


p 2 sin 2 3, 

0 

1 

p 2 sin 2 3’ 


h* = 0 


\/h = p 2 sin 3 

and the Laplace operator in the conformal space may be written as 

where A*y is the Laplace operator on a sphere: 

1 a 2 ^ 


y sin 3 d3 \ a3/ 


sin 2 3 8t p 2 


(57.07) 

(57.08) 

(57.09) 

(57.10) 

(57.11) 


As a consequence of (56.10) the harmonic coordinates must satisfy Laplace’s 
equation in conformal space. For the quantities (57.03) we have 


A *Xi = —2 Xi 

Therefore the condition for harmonic coordinates 


(57.12) 


Ax ( = 0 


(57.13) 
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reduces to the form 


d / .dr*\ . _ 


(57.14) 


This is the equation to be used in passing from the initial radius vector r to the 
harmonic one, r*. 

By applying the general formulae to the metric tensor (57.07) and (57.08) 
the following expressions for the 18 Christoffel symbols can be derived: 


F rr 

= o, 


’-i 

q* 

II 

o, 

p® — 

°, 


= — 

pp'. 


o, 

= 

0, 

r 5® 

= — 

pp' sin 2 9-, 

_ 

L 99 

— sin 9 cos 9-, 

p® — 
®® 

0, 

r r r* 

= 0, 


II 

u 

P' 

9 

P 

P®„ = 

0, 

1* 

rep 

= 0, 


I* = 

T9 

o, 

r® =-= 

r® 

p' 

9 

p 


= 0, 


a= 

0, 


cot 9 / 


> (57.15) 


Here the prime denote^ differentiation with respect to r. The Christoffel symbols 
and all tensor quantities used in this section refer only to the conformal space; 
therefore there is no necessity to attach a suffix, as was done in Section 56. 

Using the Christoffel symbols tabulated in (57.15) we form the three- 
dimensional fourth rank curvature tensor and then, using the equations 


Prr = Py, $ r + Py, <p r i 
P&& ~ ra "b P%, i 
+ e® > 


it = m 

99 Cp» r«p 


P r s — Py, ; 
^r® ~ P% • 9® ’ 

P&? ~ P». r® 


(57.16) 


the second rank curvature tensor in the conformal space. In the equations for 
the non-diagonal components we have omitted terms in which the first lower 
index is equal to the upper index: owing to the symmetry properties of the 
fourth rank curvature tensor these terms vanish if the coordinate system is 
orthogonal. In the general formula (56.16) we leave only those terms which are 
different from zero and so obtain 

spa 

(57.17) 




8r 


and after inserting the values of the Christoffel symbols from (57.15) 


*?.* = *- 


p_ 

p 


(57.18) 


The calculation shows that Py, vr has the same value: 


Therefore 


ns „ = — 

r,®r p 


Prr = 2 *- 
P 


(57.19) 

(57.20) 
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fjVr 

Rr — Sr _l p# r r 

n », r!> £> r ' 1 Or 1 aa 

(57.21) 

whence 


rO PP 

(57.22) 

Continuing the calculation, we get 


dr * 

exp ®<p i r 1 ® v® pr r 1 ® 

Cpi> A i^cp 1 f>cp 1 -9-<r rep 

(57.23) 

whence 


= ”1 + P' 2 

(57.24) 

Inserting (57.22) and (57.24) into (57.16) we find for R the 

expression 

= pp" + p' 2 - 1 

(57.25) 

Similar calculations give 


= sin 

(57.26) 

as was to be expected for reasons of spherical symmetry. 

The non-diagonal 

elements of the second rank curvature tensor prove to be zero: 

Rf!} - 0, R rtf - 0, Rcy^ — 0 

(57.27) 

The invariant of the three-dimensional curvature tensor can 
the formula 

be calculated from 

R — Rrr H- 1 R$z> 

P 

(57.28) 

V 

and will be given by 


R = - 2 (2pp" + P' 2 - 1) 

O 

(57.29) 


Applying equation (56.32) we get the following simple expression for the 
conservative tensor of the conformal space 


Hrr -^— — pp , — — pp sin 2 &, 

= 0 > H rif — 0 , — 0 

We could have obtained these expressions by a somewhat simpler method using 
the relation which, in three-dimensional space, connects the covariant fourth 
rank tensor and the conservative tensor. This relation is discussed in Appen¬ 
dix G. In the notation of this section equations (G.13) of the Appendix may 
be written in the form 


| (57.30) 


hH rr ~ R, h , h 

hH* =■ V 


hH»* = R n „ r(p ; 
hH vr = R riy , H ; 


hH™ = 7 ? r!> , rf> 
hH'» - R h , ,. r 


(57.31) 


It is easy to see that these formulae lead to expressions (57.30) as previously 
found. 

The formulae we have derived allow us to write down Einstein’s gravita- 
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tional equations in explicit form. In the previous section we saw that if one 
writes ds 2 in the form 

ds 2 = c 2 e-^ dt 2 - e 2 * da 2 (57.32) 

where da 2 has the value (57.04), the gravitational equations appear as 

A<£ = -J-x(x (57.33) 

Hit = —2<f>i<j>k + hikifif 1 ) — v-Tik (57.34) 

where the “ mass density ” p. is given by (56.44) and (56.45). doing over to 
the present case of a mass concentrated at a point and using spherical co¬ 
ordinates, in which the Laplace operator has the form (57.10) while the quan¬ 
tities Hue are given by (57.30), we obtain 

A^ = ^|(pV' ) = 0 (57.35) 



#»,<> = -PP" = P 2 f' 2 ) 


(57.36) 


The equation for H .differs from that for //,,,> only by the factor sin 2 l>, while 
the remaining equations of (57.34) are satisfied identically. 

Integrating (57.35) we get 

p 2 f = -a (57.37) 

where a is a constant. Since equation (57.35) is a limiting case of (57.33) with 
positive ui, the constant a should be taken positive. Indeed, considering first 
(57.33) and putting 

4r: J (j.p 2 dr = M (57.38) 

where the integral is extended over the whole region in which p differs from 
zero, we see that (57.37) holds everywhere outside this region, with a given by 


a = 


yM 

~iF 


(57.39) 


Here y is the Newtonian gravitational constant, related to y. by (56.51). Evi¬ 
dently M is the mass of the gravitating body; in going over to the case of a 
concentrated mass this quantity, and with it a, remain finite and positive. The 
dimensions of a are those of a length which is why it is called the gravitational 
radius of the mass. 

Inserting the value of from (57.37) into the first equation of (57.36) we 
obtain 


e ' 2 = i +- -i 

p 2 


(57.40) 


and taking the square root so as to satisfy the requirement that we have p'-» -J- 1 
at infinity, we get 


PP' = V(p 2 + “ 2 ) 


(57.41) 
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Differentiating this expression with respect to r we obtain 

pp" + p' 2 = 1 (57.42) 

which shows that the second equation of (57.36) is also satisfied. 

The differential equation (57.41) is easy to solve by quadrature; after setting 
the additive constant zero, we get 

r == vV 4 « 2 ) (57.43) 

whence 

p = - a 2 ) (57.44) 

Thus finally 

do 2 = dr 2 + (r 2 — a 2 )(dff 2 + sin 2 ffa!(p 2 ) (57.45) 

By its physical nature p must be positive and therefore the range of variation 
of r is 

r ^ a (57.46) 

We must now discuss the harmonic condition. Inserting the value of p from 
(57.44) into (57.14) we see that the harmonic radius vector r* satisfies the 
equation 

d dr* 

— (r 2 - a 2 ) ---2r* = 0 (57.47) 

dr dr 

Evidently this equation has the solution 

r* = r (57.48) 


It is easy to show that this is uniquely the solution which for finite r remains 
finite in the whole region (57.46) and which at infinity differs from r by not 
more, than a finite quantity. Therefore the variable r which enters our formulae 
is itself the harmonic radius vector and in place of (57.03) we can simply write 

x x = r sin cos <p, I 

x 2 — r sin sin <p, \ (57.49) 

x 3 = r cos ff. J 


It remains to find the quantity <f>. Integrating (57.37) and taking into account 
the boundary conditions we get from (57.44) 


or 


Hence 



(57.50) 


(57.51) 


F 2 = 


r — a. 


(57.52) 


f + « 



Principles of the Theory of Gravitation 
The expression (57.01) or (57.32) for els 2 takes on the form 


215 


ds 2 = c 2 --- dl 2 - da 2 (57.53) 

r + a r — a 

and, after inserting the value of da 2 from equation (57.45) we get 

ds 2 = c 2 j dt 2 — dr 2 ( r + a) 2 (df) 2 + sin 2 5 d«p 2 ) (57.54) 

The rigorous solution so obtained confirms our conclusion that the conformal 
space is almost Euclidean and that <j> is approximately equal to U/c 2 , where U 
is the Newtonian potential for which we can put 



(57.55) 


Indeed, equation (57.45) shows that the components of the metric tensor of 
da 2 have relative deviations from their Euclidean values of the order 


a 2 __ U 2 

7 2 ~~7 


(57.56) 


and that the error of replacing <f> by U/c 2 will be of the same order. As the 
estimate given in the following section will show, the quantity (57.56) is ex¬ 
tremely small. One should note that such close agreement with Newton’s 
theory is obtained only if harmonic coordinates are used. 

The solution of the problem of a concentrated mass in a form equivalent to 
(57.54), but in arbitrary non-harmonic coordinates, was first derived by 
Schwarzschild [18] and is often named after him. 


58. The Motion of the Perihelion of a Planet 

We have found a rigorous solution of the gravitational equations which may 
now be applied to the investigation of the gravitational fields of the Sun and 
the planets. 

We have 

ds 2 = c 2 (—-)d£ 2 — (—-—\dr 2 — (r + a) 2 (d9- 2 + sin 2 D-d® 2 ) 

\r + «/ \r — a/ 

(58.01) 

y M 

where a = —— (58.02) 

c 2 

is the gravitational radius of the mass M. For the Sun, and even more so for 
the planets, the gravitational radius a is much smaller than the geometric radius 
L, which may be defined as the radius of a sphere of volume equal to that of 
the body. We can set up the following table: 

Sun Earth Moon 

a .. 1-48 km 0-443 cm 0-0053 cm 

L .. 696,000 km 6,370 km 1,738 km 

* : L . 2 x 10- 6 7 x 10~ 10 3 x 10- 11 

For super-dense stars a is of the same order as for the Sun while L, though 
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smaller than for the Sun is not more than a hundred times smaller. Because 
of the smallness of oc : L the metric of space-time is not very far from Euclidean 
even near and inside the mass. This is all the more true in the conformal space, 
where by equation (57.56) the deviations from Euclidean values will be of the 
order (oc/L) 2 . In comparing ds 2 with its Galilean value one must remember that 
the coefficient of dl 2 has the large factor c 2 , while there is no such factor in the 
spatial part. Therefore, when the speeds involved are low, as in planetary 
motion, the deviation of </ 00 from a constant value shows up much more sensi¬ 
tively than do comparable relative deviations in the spatial part of the interval. 
Even in Newtonian approximation the Galilean value <j m = c 2 must be replaced 
by the more accurate expression y 00 = c 2 — 2 V. 

We shall now compare our rigorous solution of the gravitational equations 
with the approximate solution discussed in Section 55. To do this we must 
transform (58.01) from spherical to rectangular coordinates which must be 
harmonic. We can write the spatial part of (58.01) 

dl 2 = —— dr 2 + (r -f- a) 2 (di> 2 + sin 2 11 d® 2 ) 
r — a 

in the form 

dl 2 = ——— ■ —- dr 2 -fi (1 -)—) (dr 2 + r 2 dll 2 + r 2 sin 2 & d® 2 ) 
r — a r 2 \ rl 

In this last form it is simple to pass to rectangular coordinates. We get 

T — a / a \ 2 

ds 2 = c 2 - dt 2 — 11 + - (dx f + dx f + da|) 

r + a \ r] 

r -f oca 2 

-- {x x dx x + x 2 dx 2 + x 2 dx 3 ) 2 (58.05) 

r-ar 

/ oc\ “ T -J- oc oc 2 

whence g ik = — 11 -f -1 8a — ^ • — x t x k (58.06) 

and also g 00 = c 2 ——; g u = 0 

If in (58.05) we neglect the square of the ratio a/r we obtain the 
expression (55.45) with the Newtonian potential 

Z7 = c 2 - = — 

r r 

Equations (58.06) and (58.07) lead to the following value of the determinant 


g in harmonic coordinates 




g = -c 2 (l +£) 

(58.09) 

We note that the quantity 




\/(~ = 1 + ; 

(58.10) 


satisfies a Laplace equation w r ith Euclidean coefficients. In the next chapter 
(Section 68) we shall see that quite generally the fourth root of (— g/c 2 ) approxi- 


(58.07) 

approximate 

(58.08) 


(58.03) 

(58.04) 
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mately satisfies a d’Alembert equation with Euclidean coefficients. 

Using (58.06) and (58.07) again, or else transforming directly the equation 




r + a 1 / &JA 2 Ir — a\ / cty\ ! 
r — a c 2 \ 8t) \r + a/ \drj 


[r + a) 2 


(^y | i /&ia 

\9ff/ sin 2 -9- \ Sep/ 


(58.11) 


to rectangular coordinates, we obtain the values of the contravariant com¬ 
ponents of the metric tensor. We give the expressions for these components 
multiplied by \/(— g). They are 

= V( ~ 9) y * = — <&ik + «c 2 (58.12) 

and go° = - ^ ; g 04 = 0 (58.13) 

c 1 — «/r 


These relations allow one to verify readily that our coordinates are really 
harmonic and that 


d Q ik = Q 
dx k 


(58.14) 


Having found the gravitational potentials for the field of a concentrated 
mass we can determine the motion of a particle in the field with the aid of the 
assumption that it will move along a geodesic. We saw in Section 51 that this 
assumption is in agreement with Newtonian mechanics. A fuller justification 
of this law of the motion of a mass point will be given in Section 63, where it 
will be deduced from the gravitational equations. 

As we know the equations of a geodesic are obtainable from the variational 
principle 

S J ds = 0 (58.15) 

which may also be written in the form 

8 J i = 0 (58.16) 

Here L is the Lagrangian and in our case it is equal to the square root of the 
expression 

L 2 = c 2 — (1 + - ) (if + x\ + i§) — T + K ^ ( x 1 x 1 + x 2 x t + x 3 x 3 ) 2 
r + a. \ r) r — ar 4 

(58.17) 

in which the dots denote differentiation with respect to time. We thus have 
before us a simple problem in point mechanics. 

In order to solve this problem we note first that the Lagrangian is spherically 
symmetric. This means that it does not change when the sets of quantities 
(xj, x 2 , x 3 ) and (iq, i 2 , x 3 ) undergo the same linear orthogonal transformation. 
A consequence of this, as always in mechanics (and see also Section 27), is 
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that there exist the integrals of the motion 


dL 8L 

X 2 T7- X 3 "XT“ 

8L dL 

3 8x x dx 3 

8L 8L 

Xl dd 2 ~ X 2 'd ± 1 


= Ci 


— c 2 


— e 3 


(58.18) 


Consequently the trajectory of the particle lies in the plane 

+ c& 3 + c 3 x 3 = 0 (58.19) 

It does not restrict the generality of the problem to choose this plane as one 
of the coordinate planes, so that 

*3 = 0; * 3 = 0 (58.20) 


We then have to deal only with motions in the plane, and plane polar coordinates 
are most convenient for this. Our previous spherical coordinates reduce to 
these if we put 

0 = ‘tt ; & = 0 (58.21) 

Rewriting the square of the Lagrangian in these polar coordinates we get 


Z 2 = e 2 --- - r°- - (r + oc) 2 <p 2 (58.22) 

r + a .r — a 

The Lagrangian is independent of the time t and also independent of the 
angle 9 . This immediately gives us two integrals : 


and 


8L 8L T 
r — 4- m — — L = const. 
dr 8(p 

8L 

— = const. 


(58.23) 

(58.24) 


which correspond to the usual integrals of energy and of angular momentum. 
Remembering that 

L dt = ds = cdx (58.25) 


where t is the proper time, we can rewrite the integrals (58.23) and (58.24) as 


r — a. dt 
r + a d-c 


(58.26) 


and 


<r + «)*g = . 


(58.27) 


Here e and p are constants, 
momentum of a unit mass. 


The quantity p can be interpreted as the angular 
If we put 



(58.28) 


where E 0 is a new constant, our equations show that in non-relativistic approxi¬ 
mation m 

E o = 2 v% -- 


r 


(58.29) 
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so that E 0 is the total energy of a particle of unit mass. 

An algebraic consequence of (58.26) and (58.27) is the relation 

[x 2 (r - a) 


/ dr\ 2 
Wt/ 


c 2 e 2 


- a 


r + a (r + a ) 3 
It is derived by inserting (58.26) and (58.27) into the identity 


(58.30) 




The foregoing equations give us three first order differential equations for the 
quantities r, 9 and t as functions of t. The solution of these equations evidently 
reduces to quadratures. We shall not give the integrals explicitly but will 
confine ourselves to the discussion of the trajectory of the particle, i.e. of the 
dependence of r on 9 . 

Eliminating dr from (58.27) and (58.30) we obtain 
/ dr\ 2 c 2 e 2 c 2 

(dqj) = ‘jl 2 ' ^ + ^ — jT 2 ^ + a ^ r — a ) ~ ( r + a )( r — *) (58.32) 

We have here on the right-hand side a polynomial of the fourth degree in r. 
Therefore 9 will be expressible in terms of r as an elliptic integral of the first 
kind and conversely r will be an elliptic function of 9 . The real period of this 
elliptic function will differ somewhat from 2tc and therefore the orbit will not 
be a closed one. The polynomial on the right-hand side of (58.32) obviously 
has the one negative root r = — a; it also has a small positive root 


fo 


r>u 


a + 


8 a 3 c 2 e 2 


(58.33) 


and two further roots r x and r 2 . If e 2 < 1 both these roots are positive and we 
shall always have r x < r < r 2 , i.e. the orbital motion is a finite one. If, on the 
other hand, e 2 > 1 , one of the roots, say r 2 , becomes negative ; we then have 
r x < r, and the orbit extends to infinity. If e 2 = 1 we have r 2 = 00. 

If we introduce the variable 

u = - (58.34) 

r 


in place of r and write the polynomial in an expanded form, we get 
2 c 2 (e 2 — 1 ) , 2 <xc 2 /t> _ s , / 6 aW \ 


!duY‘ 

w 


+ ^ (2e 2 - l)u + 


r 


1 M 2 




2 a 3 c 2 ( 2 e 2 + 1 ) , 

+ - • —r- --- «» + 


a 2 ( 


(58.35) 


We now estimate the orders of magnitude of the various terms in this expression. 
As in Section 55 we introduce a characteristic velocity q and a characteristic 
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length l. Then we have, in orders of magnitude 


e 2 — 1 ~ ; 

c 2 


IY 




1 

U ~ - 


From these estimates we can easily see that on the right-hand of (58.35) the 
terms involving the zero, first and second powers of a u are of the order of 1 /Z 2 
whereas the terms containing the third and fourth powers of u are of order 
{q*jc i )Tjl 2 . If, therefore, we merely neglect some very small terms, namely 
those of order g 4 /c 4 (or a 2 /! 2 ) compared to unity, we can drop the last two 
terms in (58.35) leaving the equation 


/ du \ 2 

w = 


c 2 (e 2 - 1) 


2ac 2 

7T 


— - 1 -— ( 2 e 2 — l)u + i 

(A 2 p . 2 \ n 


/ 6 « 2 c 2 e 2 


1 w 2 


(58.36) 


The roots of this quadratic polynomial will correspond to the two roots referred 
to above as r 1( and r 2 . We put 

1 1+6 - 1 (58.37) 


Mi = 


U, = — — 


- r x p r 2 P 

where p and e are new constants related to our original constants e and y. 
Approximately we have 


1 - e 2 = - (1 
P 


e 2 ) 


We also put 


so that, approximately 


(A 2 = a c 2 p — y Mp 

2 i 6a 
P 
3 a 

v = 1- 

P 


In this notation we can restate equation (58.36) as 
1 /dw\ 2 _e 2 
v 2 \<V 

The solution of this equation is 


1 2 

—|— u — w 2 
P 


u = 


1 + e cos v<p 
P 


(58.38) 


(58.39) 


(58.40) 


(58.41) 


(58.42) 


Here the constant of integration has been so chosen that the largest value of 
u or the smallest value of the distance r corresponds to the value <p = 0. The 
expression (58.42) describes well the general nature of the motion. If v were 
equal to 1 we would have an ellipse, a parabola or a hyperbola of semi-latus 
rectum p and eccentricity e. We shall discuss the case of an ellipse, e < 1. 
The radius vector r returns to its original value when the angle <p increases 
not by 2n but by the somewhat greater value 2tc/v. The difference 


. 2tc _ 6tmc 

V p 


(58.43) 

gives the displacement of perihelion after one period of revolution of the planet. 
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The orbit of the planet may therefore be described as a processing ellipse. 

One may note that Einstein’s equations of motion of a planet reduce to the 
form of the classical equations of motion of a spherical pendulum; therefore 
the trajectory of the planet has the same form as the trajectory of the bob of 
a pendulum.f 

For all planets the numerical value of <p' is exceedingly small. For the 
Earth, for instance, we can take p = 1-5 x 10 8 km and, using the value 
a — T5 km, we get 

A<p = 6 tc X 10 - 8 = 0-038" 

in one revolution, i.e. one year, or in other words 3-8" per century. For Mercury 
the advance of perihelion per century is much greater, namely 43", because 
in the first place it is considerably closer to the sun, having an orbit of a radius 
0-39 times that of the Earth, and also because its period of revolutions is 
shorter—420 revolutions per century. 

In comparing this theory with experiment it is important to remember 
that a motion of the perihelion is caused not only by the Einstein effect but also 
by the perturbing influence of other planets, by their deviation from spherical 
shape and so on. Such corrections are many times greater than those arising 
from Einstein’s theory. One must also bear in mind that the observation of 
the position of the perihelion is the more difficult the smaller the eccentricity 
e, i.e. the closer the orbit is to circular shape. For e = 0 the position of peri¬ 
helion is indeterminate. Nevertheless, astronomical methods of observation 
are so accurate and computational possibilities in celestial mechanics so great 
that in the case of Mercury a residual advance of the perihelion, unexplained 
by Newtonian theory, is known to an accuracy of one second per century. 
It is 42-6" per century, in excellent agreement with the theory. For the Earth 
this residual advance of the perihelion is known with rather less accuracy to 
be about 4" ; this is also in full agreement with the Einstein value. 


59. The Deflection of a Light Ray Passing Near the Sun 

We now consider another observable consequence of Einstein’s theory of 
gravitation, the deflection of a ray of light passing close to the Sun. 

Before integrating the equation for a light ray let us first form a general 
picture of the propagation of fight in the gravitational field of the Sun. 

We shall write the law for the propagation of the front of a light wave in 
the form 




0co 8u> 
8x^ &c v 


(59.01) 


where we have multiplied our previous equation by V(~9)- Using equations 
(58.10) and (58.11) we have, in spherical coordinates 


(1 + a/r ) 8 l/a W \ 2 _/ _<x 2 \/ 0 co \ 2 l[7Sw \ 2 1 /M 2 ]_ 0 

(1 — a/r) c 2 \dt) \ r 2 )\dr) r 2 ^sin 2 ff\ 0 <p/ 

(59.02) 

If we neglect terms of the order a 2 // 2 in comparison with unity the equation 


t See the diagram in A. N. Krylov’s book [19]. 
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for co reduces to 

n 2 / Sco\ 2 

?(») -<g»»<■>>■ = « 


where 


n 2 = 1 + 


4a 


, 2 a 

n = H- 

r 


(59.03) 


(59.04) 


This equation can he formally interpreted as the law of propagation of light 
in Euclidean space, but in a medium of refractive index n. 

We note that (59.03) could also have been obtained from the approximate 
expression 


ds 2 = (c 2 - 2U)dl 2 




•f + dx | + dx |) 


(see (55.45)) the effective refractive index then being 

, 2U 
n= 1 + — 
c 2 


(59.05) 


(59.06) 


On the other hand, the approximate form (51.10) for ds 2 which was applicable 
for slow motions would lead to a form for the effective refractive index in which 
the coefficient of U is half the above. As we shall see below experiment confirms 
the expression (59.06) derived from the quadratic form (59.05). 

The fictitious medium of refractive index (59.06) is optically more dense 
in the vicinity of the Sun than it is far away from it. Therefore, light waves 
will bend around the Sun and light rays passing near the Sun will not be 
straight. We shall see that they will be describable as branches of hyperbolae 
with the Sun at one focus. The angle between the asymptotes of such a hyper¬ 
bola will determine the observed deflection of the ray. 

As we know from Section 38, a light ray represents a nul geodesic and its 
Hamilton-Jacobi equation is the equation of wave front propagation. (See 
also Appendix F.) Since in the previous section we have already solved the 
problem of a geodesic of finite length we can obtain the equation of a ray from 
the results of Section 58 by means of a limiting process. Let us recall these 
results. We found the integrals of motion 


r — a dt 
r + adx 


(58.26) 


and 


(r + a ) 2 


d <p 
dx 


= F 


(58.27) 


and also the equation for the trajectory 

dcfj = ^ + a > 4 ~ jl 2 ^ + a ^ r “ a ) ~ ( r + a )( r “ «) (58.32) 

Since for a light ray dx — 0 the constants in (58.26) and (58.27) will be infinite, 
but their ratio 

(r + a ) 3 dq> (i. 
r — a dt e 


will be finite. Consequently, equation (58.32) will have the form 


(59.07) 
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/ dr\ 2 c 2 

y~j = jji ( r + a ) 4 — ( r + a X r “ a ) (59.08) 

In place of the constant pj it is convenient to introduce another constant b, 
of the dimensions of a length, by the relation 


lim - = = cb (59.09) 

Then (59.08) becomes 

!dr\ i 1 

y = p (r + a)4 -( r + a )( r -«) ( 59 - 10 ) 

and the corresponding relation for u — 1 /r is then 
tdu\ 2 1 

(di) = gs (! + ««) 4 - u 2 + oc 2 « 4 (59.11) 


If r and 9 are interpreted as polar coordinates in a Euclidean plane the constant 
b just introduced will be the “ impact parameter ” i.e. the length of the per¬ 
pendicular from the origin to the asymptote of the trajectory. Indeed, elementary 
relations in plane Euclidean geometry give the following expression in polar 
coordinates for the perpendicular distance from the origin to the tangent to a 
curve 


■\/{l + (drjdf) 2 jr 2 } \7{tt 2 + {du/dy) 2 } 


(59.12) 


The asymptote is the tangent at infinity (u = 0) and, therefore, the impact 
parameter is the value of d for u = 0. By (59.11) and (59.12) this is equal to b. 

Let us return to the equation for the path of the ray in the form (59.11). 
If u is taken to be of order 1/6 the terms involving u 3 and w 4 in this equation 
will be at least of order a 2 / 6 2 in comparison with the main terms. Dropping these 
small terms we obtain an equation which can be solved by elementary means. 
The solution has the form 


2 a 1 

M== 6~ 2 + 6 C0SCp 


(59.13) 


The constant of integration has been chosen so that the greatest value of 
m, and therefore the least of r, corresponds to 9 = 0. We find approximately 


= 6 — 2 a 


(59.14) 


In the Euclidean (r, 9 ) plane equation (59.13) describes a hyperbola. The 
directions of the asymptotes of this hyperbola are determined from the con¬ 
dition u = 0 which gives 


cos 9 = 



(59.15) 


Here the right-hand side is a small negative quantity, so that the limiting 
values of the angle will be 

9 = 2 71 + §; 9 = —s 


(59.16) 
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where the small positive quantity 8 can be put equal to 

8 = ^ (59.17) 

b 

The angle by which the ray is deflected is the angle between the asymptotes 
of the hyperbola, which is 

28 = ^ (59.18) 

0 

The displacement of the observed position of a star whose light passes close 
to the Sun can be observed during a total eclipse of the Sim. If b is put equal 
to the Sun’s radius the angle of deflection 28 takes on the value 

28 = 1-75" (59.19) 

which is in good agreement with observed values. An evaluation of the results 
of observations during the eclipse of 1952 gives a value of 1-70". This result 
allows one to assert quite definitely that the observations agree with the 
expression (59.05) for ds 2 and not with (51.10) which predicts 0-87", half of the 
observed value. 

In conclusion we make a remark concerning the definition of a straight line 
in the theory of gravitation. How should a straight line be defined : as a light 
ray or as a straight line in that Euclidean space in which the harmonic co¬ 
ordinates x lt x 2 and x 3 serve as cartesian coordinates? It seems to us that the 
only correct definition is the latter. We have, in fact, used it when we said that 
a light ray near the Sun has the form of the hyperbola (59.13). In the cases here 
considered the harmonic coordinates are deeply related to the nature of space 
and time and the definition of a straight line should be based on them. The 
consideration that a straight line defined as a light ray is more immediately 
observable, is of no significance. What is decisive in a definition is not immediate 
observability but a correspondence with Nature, even if this correspondence 
has to be established by indirect reasoning. 

60. A Variational Principle for the Equations of Gravitation 

In the equations of gravitation 

jB" - \g^R = - ^ T* (60.01) 

we have on the left-hand side the conservative tensor and on the right the 
mass tensor. In Section 48 we saw that the expression for the mass tensor 
can be obtained by varying the action integral with respect to the components 
of the metric tensor. Thus in the case of the equations of hydrodynamics the 
action integral could be written in the form 

s = J (p*c 2 + p*ri W(-g)-(dx) (60.02) 

Here p* is the invariant density of that part of the rest-mass which does not 
change in the motion and which satisfies the equation of continuity (48.29), 
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and 11 is the elastic potential energy of the fluid per unit mass as defined by 
(48.30). If the integral S is varied with respect to the components of the metric 
tensor the result is 


BgS 


j J WtVf-sl'W 


(60.03) 


where T^'‘ is the hydrodynamical mass tensor defined by (48.39). In the case 
of electrodynamics the action integral has the form (47.37); its main term is a 
function of the rest-mass and is the same as in the hydrodynamical case. The 
variation of the electrodynamic action integral with respect to the g again 
has the form (60.03) where now T |iV is the electrodynamic mass tensor as given 
by (46.22) and (46.32). As regards variations of the action integral with respect 
to other quantities entering it, such as displacements and field components, 
we saw that they give the equations of motion of the material system in 
question. 

We shall assume that we have a system such that for it equation (60.03) is 
valid with a suitably chosen S. Then the mass tensor which enters the right-hand 
side of the gravitational equations can be represented as the coefficient of 8 
in the expression for the variation of a certain integral. We shall now attempt 
to represent the left-hand side of the gravitational equations, i.e. the conservative 
tensor, in a similar manner. 

To do this we consider the expression 


1 = J Ryt(-g)-(dx) ' (60.04) 

where R is the curvature scalar and we perform the variation of this integral. 

In evaluating the variation of the integral we use the fact that the variations 
of the ChristofEel symbols form a tensor, although the Christoffel symbols 
themselves do not. This can be proved as follows. According to (42.04) the 
transformation law for Christoffel symbols has the form 


8x 8x 

[X v 


_p P ^2 = 
8x 0 

P 


8x' 8x'„ 
v a 3 

Ii v 


(60.05) 


This relation is valid for the quantities r p v related to the given metric (g a f>- 
We perform a variation of the metric while retaining the connection between 
old and new coordinates. To the metric ( g a(3 + &g a f) there will correspond the 
quantities F p v + 8r^ v where 


8x' 8x' dx'r, 

Sr* —s = (§r° y — —$ 

■" ^ 8x^ 8x ,, 


(60.06) 


This proves that the variations 8r p v form a mixed third rank tensor. 
In for min g the variation of the scalar R we start from the expressions 


and 


J?a 


K, — 


CQ. _ 

CO 

0 P 1 

>* v + r p „r“ 

8x & pv 

_PP Pa 

1 pv 1 pp 

ar“ 

ixa 

8x„ 

ST* 

_L pp fa 

8x„ pv 

- Pp pa 
[jlv pa 


(60.07) 


(60.08) 


which give the mixed components of the fourth rank curvature tensor and 
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the second rank curvature tensor. In a coordinate system that is geodesic at a 
given point the variation of is 

8 K «'“ii; < 8 r y-;£- (Sr y <60 - 09) 

because at this point all the Christoffel symbols vanish. Hence with a change 
of indices in the first term on the right 

rsz*. = r ~ («*) - r ™ (*ry (60.10) 

a a 

We now introduce the vector 

^ = <rsr£ v -<rsrjt v (eo.ii) 

It is easy to see that (60.10) is equivalent to the equation 

»>•«■> (60 ' 12) 

since in a geodesic system one can take the quantities g ,iV = \/(— g) -g^ out¬ 
side the sign of differentiation; but both sides of the equation (60.12) are 
scalars so that if it holds in a geodesic system it must hold generally. 

We assume that not only the 8but also their derivatives and therefore 
the Sr“ v vanish on the boundaries of the region of integration and that the 
vector w* also vanishes. Then, writing the integral I as 

I = J K^V(- 9)-(to) = J 

we obtain for its variation the expression 

*i = J Rjtrm 

since by (60.12) we have 

J Ml^-g^Vi- 9)-(dx) = 0 

Using the relations 

W(-9) = W(-9)-<r & $9 a * 

and 

*sr = -<nf*t9't 

we get 

= s{ v( - g) -<r} = V(- 9 )- (hr?* - sm*) (eo.is) 


(60.13) 

(60.14) 

(60.15) 

(60.16) 
(60.17) 


Inserting this expression into (60.14) and performing the summations over 
(a and v we obtain 

8 / = J (| jf+B - i^)8<7 o0 V(- 9) • (dx) (60.19) 


Our immediate object has been achieved ; we have represented the conservative 
tensor as the coefficient of the variation with respect to the metric tensor. 

Combining the equations (60.03) and (60.19) we can conclude that the 
variation of the expression 


W = <S 


c* 

167iy 


I 


(60.20) 
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with respect to the components of the metric tensor is 

+§£<»"- ^)} S ^vV(- 9)■ (dx) (60.21) 

This variation vanishes by virtue of the gravitational equations (60.01) and these 
equations in turn may be obtained from the variational principle SIT =0, if 
the variation is with respect to the components of the metric tensor considered 
as independent. (We recall that we have already discussed the variation of 
the action integral with, respect to the g in Section 48, but that there the 
8 were not completely arbitrary because they corresponded to an infinitesimal 
change of the coordinates and were expressible in terms of four functions Y) v .) 

If IT is varied with respect to the other functions that enter the action 
integral S we obtain the equations of motion and field equations for these 
functions. 

Thus the equations (60.01), the field equations of gravitation, are now unified 
with the other field equations (for the velocity field of matter, the electro¬ 
magnetic field, etc.) in one general variational principle. 

The variational principle may be given a somewhat different form by taking, 
in place of the invariant integral 

I — J R \/(— g) ■ (dx) (60.04) 

another integral which is not invariant, but which does not contain any second 
derivatives. 

In Appendix D we derive the relation 

R = By — r — L (60.22) 

where □ is the d’Alembert operator acting on y = log (— g), see (D.51), T is 
given by (D.43) and the quantity L may, according to (D.54), be written as 

L = - r« v ry (60.23) 

as well as in many other forms given in Appendix D. 

Using the definitions (D.59) and (I).61) one can also replace (60.22) by 

R -wi)k W{ ~ ,) ' i,r ~ r ' )} ~ L (6c - 24) 

The integral I will differ from the expression 

I* = - j Ls/{ - g) ■ (dx) (60.25) 

by the quantity 

r = J h {Vi9) ’ ^ ~ rv)}(<fo;) (60-26) 

which reduces to a surface integral and has a vanishing variation. Therefore 
the variations of I and I* are equal: 

87 = 8 J RV(- 9 )-{dx) = - 8 J 7V(-ff)•(<*») = 87* (60.27) 

Since in a variational principle only the variations of the integrals, not the 
integrals themselves, are of importance, we can replace 7 by 7* in (60.20). 
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The quantity 8 1* (which is equal to 87) is independent of the coordinate system 
in spite of the fact that I* itself may depend on the coordinate system. The 
purpose of replacing 7 by 7* is the removal of second derivatives from the 
integrand. The relation 

8 J W(-g)-(dx) =J (TP# - Ig^R) 8 g a «s/(-g)-{dx) (60.28) 

which follows from (60.19) and (60.27) can naturally also be derived directly, 
though the necessary calculations are somewhat involved. 

61. On the Local Equivalence of Fields of Acceleration and of Gravitation 

As we have seen, the law of equality of inertial and gravitational mass, which 
is a rational dynamic generalization of Galileo’s law (Section 50), forms an 
intuitively satisfactory basis for the theory of gravitation. We therefore used it 
as the foundation of our argument which, after some generalizing assumptions, 
led us to Einstein’s gravitational equations (Sections 51 and 52). When Einstein 
himself derived his gravitational equations, he argued rather differently. In 
contrast to our treatment based on the dynamic formulation of Galileo’s law 
Einstein took as his basis a certain kinematical consequence of that law, which 
he called the “ Principle of Equivalence ”. 

In the theory of gravitation the Principle of Equivalence is understood to be 
the statement that in some sense a field of acceleration is equivalent to a 
gravitational field. The equivalence amounts to the following. By introducing 
a suitable system of coordinates (which is usually interpreted as an accelerated 
frame of reference) one can so transform the equations of motion of a mass 
point in a gravitational field that in this new system they will have the appear¬ 
ance of equations of motion of a/reemass point. Thus a gravitational field can, 
so to speak, be replaced, or rather imitated, by a field of acceleration. Owing 
to the equality of inertial and gravitational mass such a transformation is the 
same for any value of the mass of the particle. But it will succeed in its purpose 
only in an infinitesimal region of space, i.e. it will be strictly local. 

In the general case the transformation described corresponds mathematically 
to passing to a locally geodesic system of coordinates (see Section 42). As was 
shown by Fermi, it is possible to introduce coordinate systems which are locally 
geodesic not only at one point but also along a time-like world line (see [14]). 

Thus the principle of equivalence is related to the law of equality of inertial 
and gravitational mass, but is not identical with it. The latter is of a general, 
non-local character while the equivalence of a field of acceleration and a field 
of gravitation exists only locally, i.e. it refers only to a single point in space 
(more precisely to a spatial neighbourhood of the points on a world line, which 
is of the nature of a time axis). 

The Principle of Equivalence played an important role during the period 
before Einstein created his theory of gravitation. We shall now describe and 
analyse an argument given by Einstein at that time. 

Einstein illustrated his “ equivalence hypothesis ” with the example of a 
laboratory inside a falling lift. All objects within such a lift appear bereft of 
their weight, they all fall together with the lift, with the same acceleration, 
so that their relative accelerations vanish even when they are not fixed to the 
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walls of the lift. We have, according to Einstein, two frames of reference, 
one inertial, or almost inertial, fixed to the Earth and another accelerated, 
fixed to the lift. In the first, inertial frame, there exists a gravitational field— 
in the second, accelerated frame, it is absent. Thus, according to Einstein, an 
acceleration can replace gravitation or at least a uniform field of gravitation. 
Einstein develops this idea further. He proposes to consider both the accelerated 
and the unaccelerated frames to be physically completely, equivalent and points 
out that from such a point of view the concepts of inertial frame and absolute 
acceleration cease to have any meaning. 

Let us analyse this view of Einstein’s in more detail. First of all the question 
arises : What is an accelerated frame of reference and how can it be realized 
physically? In the lift example the “ frame of reference ” was, so to speak, 
identified with a certain box, the lift cage. But we have learnt in Section 32 
that even when gravitation is not taken into account the abstraction of an 
absolutely rigid body is not acceptable; when accelerated all bodies will 
experience deformations which will be different for different bodies. Therefore 
a box or a rigid scaffolding of the kind we discussed in Section 11 when dealing 
with inertial frames are of no use as models for an accelerated frame of refer¬ 
ence. Thus in Einstein’s reasoning the basic concept of a frame of reference 
in accelerated motion remains undefined. This difficulty could be avoided 
only by imposing limitations on the magnitude of the acceleration and on the 
size of the region of space to be considered. For instance, one could demand 
the following: the accelerations allowed are to be so small that in the region 
of space considered the deformations resulting from them may be neglected 
and the notion of a rigid body may be used. In that case the approximate 
nature of Einstein’s argument becomes obvious. 

Further, Einstein himself stresses that not every gravitational field can be 
replaced by acceleration ; for this to be possible the gravitational field must be 
uniform. This also imposes limitations on the spatial dimensions of the region 
in which gravitational and accelerated fields may be approximately equivalent. 
It is, for instance, impossible to “ remove ” the gravitational field around the 
terrestrial globe ; to do it one would have to introduce some absurdity such as 
a frame of reference in “ accelerated contraction ”. 

Einstein also used his Principle of Equivalence in a non-local manner but 
his attempt, in a paper published in 1911, to investigate in this way the propa¬ 
gation of light near a heavy body gave a deflection of a light ray of only half the 
amount resulting from his theory of gravitation (see Section 59). This is con¬ 
nected with the fact that the Principle of Equivalence cannot possibly lead 
to the correct form (51.11) for is 2 but at best only the expression (51.10) which 
is valid for slow motion. Thus, in a non-local interpretation, the approximate 
equivalence of fields of gravitation and of acceleration is also limited. As already 
mentioned this equivalence exists only for weak uniform fields and slow motions. 

Einstein gave to his principle of equivalence a widened interpretation by 
taking it to imply the indistinguishability of fields of gravitation and accelera¬ 
tion and asserting that from the point of view of this principle it is as impermis¬ 
sible to speak of absolute acceleration as it is to speak of absolute velocity. 
To this Einstein related his “ General Principle of Relativity ” which we dis- 
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cussed in Section 49*; he used the latter to justify the demand that his equations 
should be generally covariant. However, to us such an extended interpretation 
seems inconsistent. The essence of the principle of equivalence may be seen in 
the fact that it allows the introduction of an appropriate locally geodesic 
(“ freely falling ”) frame of reference, by use of which a uniform Galilean space 
can be defined in the infinitesimal. However this in no way justifies conclusions 
about the equivalence or indistinguishability of fields of acceleration and of gravi¬ 
tation in finite regions of space. To illustrate the nature of the error committed 
in drawing such conclusions let us examine a mathematical example, which 
incidentally has a direct bearing on the essence of the present question. All 
functions that have bounded second derivatives behave as linear functions in 
the infinitesimal. However, this by no means allows one to conclude that all 
such functions are indistinguishable in a finite region. But an analogous con¬ 
clusion, namely that fields of acceleration and of gravitation are completely 
indistinguishable, was drawn by Einstein, on the basis of their local equivalence 
alone. 

Such a conclusion even contradicts Einstein’s theory of gravitation itself. 
Indeed, if full equivalence between fields of acceleration and of gravitation did 
exist, a theory built on the idea of equivalence would be purely kinematical, 
which is by no means the case for Einstein’s theory of gravitation. As regards 
the “ General Principle of Relativity ”, we have already pointed out in Section 
49* that such a physical principle is impossible, and also unnecessary as a basis 
for the requirement of general covariance, which is the purely logical require¬ 
ment of consistency for a theory in which the coordinate system is not fixed. 

Thus, although the principle of equivalence holds in a narrow sense (approxi¬ 
mately and locally) it does not hold in a wider sense. Although the effects of 
acceleration and of gravitation may be indistinguishable “ in the small ”, i.e. 
locally,.they are undoubtedly distinguishable “in the large”, i.e. when the 
boundary conditions to be imposed on gravitational fields, are taken into account. 
The gravitational potential that is obtained if a uniformly accelerated frame 
of reference is introduced is a linear function of the coordinates and therefore 
does not satisfy the conditions at infinity, where it should tend to zero. In a 
rotating frame of reference the potential of the centrifugal force increases with 
the square of the distance from the axis of rotation, and in addition there are 
Coriolis forces. By these characteristics it is possible to detect immediately 
that the “ gravitational field ” in such frames of reference is fictitious. 

We shall now discuss the example of a uniformly accelerated frame of 
reference in somewhat greater detail, taking the theory of relativity into 
account. In doing this we set aside the question of how an accelerated frame 
might be realized and interpret the term “ frame of reference ” more formally 
in the sense of “ coordinate system In this sense passing to a frame moving 
with acceleration will mean subjecting the coordinates to a transformation 
which contains time non-linearly. 

We assume that a true gravitational field is not present so that the square 
of the infinitesimal interval has the form 

ds 2 = c 2 dt' 2 - ( dx' 2 + dy’ 2 + dz ' 2 ) 


(61.01) 
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where x', y' and z' are cartesian coordinates and i' the time in some inertial 
frame of reference. We perform the coordinate transformationf 


x’ = x. cosh — + — (cosh — — l) 
c g \ c / 


y ‘= y ; * = * 

c . , at x . at 

t = - sinh-1— smh — 

g c c c 


(61.02) 


gt 

- < 1 
c 


where g is a constant of the dimensions of acceleration. Under the condition 

(61.03) 

the previous equations may be written as 

x' = x + \gt 2 ; y' = y, z' = z; t' = t (61.04) 

Inserting (61.02) into (61.01) we obtain 


ds 2 


4 +?)' 


dt 2 — (da; 2 + dy 2 + dz 2 ) 


(61.05) 


The question arises : can this expression be interpreted as the square of the 
interval in some inertial frame of reference in which there is a gravitational 
field? The answer to this question is also an answer to the question whether, 
and to what extent, the Principle of Equivalence is correct. 

To find the answer we compare (61.05) with the approximate expression 
given by the theory of gravitation. 

ds 2 = (c 2 - W)dt 2 - (l + + <% 2 + dz2 ) (61.06) 

where U is the Newtonian potential of a true gravitational field. 

Under the condition 

\gx\ <c 2 (61.07) 

the coefficients of dt 2 are approximately equal if we take a gravitational potential 
given by 

U = —gx (61.08) 

As for the coefficient of the spatial part of ds 2 , it will not differ significantly 
from unity for intervals for which the quantity 

Ht)'+(§)’+(§)’ 

satisfies the inequality 

v 2 -4 c 2 


(61.09) 

(61.10) 


The value (61.08) for the gravitational potential does indeed lead to uniformly 
accelerated motion in Newtonian mechanics. For vanishing initial velocity 
we have constant values of y' and z' and approximately 

x + | gt 2 = const. 

f This transformation was given by Mailer [20]. 


(61.11) 
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which describes uniformly accelerated motion in the coordinates ( x , i). 

We have made a comparison between two expressions for the square of the 
interval which has shown that a frame of reference in accelerated motion in 
the absence of gravitation does indeed show an analogy with an inertial frame 
in the presence of gravitation. However, the same comparison indicates that the 
analogy is far from complete, so that there can be no question of full equivalence 
or indistinguishability of inertial and gravitational fields. This becomes par¬ 
ticularly clear if one considers the expression (61.05) “in the large”, i.e. 
throughout the whole of space. In the first place the coefficient of dt 2 does not 
saitsfy the boundary conditions, since it tends to infinity with x, in the second 
place that coefficient and with it the speed of light, become zero on the surface 
x = — c 2 /g ; this is inadmissible. 

An even more obvious violation of the boundary conditions for the metric 
tensor occurs if the transformation (35.47) is used. In Newtonian mechanics it 
has the significance of introducing a rotating coordinate system. This trans¬ 
formation leads to the expression (35.48) for ds 2 . Here the metric tensor not 
only fails to satisfy the boundary conditions but, at large distances from the 
axis of rotation, also violates the inequalities established in’ Section 35. The 
impossibility of interpreting the metric tensor in (35.48) as some gravitational 
field (i.e. in the spirit of the “ equivalence hypothesis ”) is clear even from a 
local point of view, owing to the presence of Coriolis forces. 

The example just discussed confirms completely the conclusion stated above 
that the “ equivalence ” between acceleration and gravitation exists only in a 
limited region of space, and only for weak and uniform fields and slow motions 
(equation (61..08) together with the inequalities (61.07) and (61.10)). But if 
one considers the whole of space, true gravitational fields can be distinguished 
from fictitious ones caused by acceleration. In Newtonian theory this can be 
done by using the boundary condition for the Newtonian potential. In Ein¬ 
stein’s theory the question of distinguishing true from fictitious gravitational 
fields is most simply solved if harmonic coordinates are used. Then the com¬ 
ponents of the metric tensor must satisfy both the harmonic conditions (53.13) 
and the boundary conditions discussed in Section 54. As will be shown in Section 
93, harmonic coordinates can be defined uniquely apart from a Lorentz trans¬ 
formation. Arbitrary coordinate transformations by which fictitious gravita¬ 
tional fields are introduced, violate the harmonic conditions and the boundary 
conditions. Therefore one can take it that the introduction of harmonic co¬ 
ordinates excludes all fictitious gravitational fields. Thus, if one assumes the 
quadratic form (61.05) to be given, the passage to harmonic coordinates will 
consist in the transformation (61.02), accompanied possibly by a Lorentz 
transformation. As the result of such a transition we come back to the quadratic 
form (61.01), the form of which indicates the absence of true gravitational fields. 

In the discussion of this section we did not use general tensor analysis. 
Its application to (61.05) would have shown that the fourth rank curvature 
tensor vanishes and that, therefore, true gravitational fields are indeed absent. 

Let us return to the question of utilizing the principle of equivalence to derive 
the gravitational equations. We have made it clear that it is inconsistent to 
interpret this principle in a wider sense as a “ (leuoral Principle of Relativity ”. 
But this does not. exclude the use of the principle of equivalence in a more re- 
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stricted sense, within the limits in which it is valid approximately. In particular 
the analogy we have discussed between an accelerated frame of reference in the 
absence of a gravitational field and an inertial frame in the presence of such a 
field may prove helpful, because the possibility of transforming the expression 
(61.01) into the form (61.05) gives us an indication of the fact that the Newtonian 
potential must enter the theory precisely as the coefficient of dt 2 . However, an 
approach based on this idea to the formulation of a gravitational theory seems 
to us to be unsatisfactory because of its inherent limitations (viz. the local 
nature of the principle of equivalence and the assumption that the field is 
uniform). Another disadvantage of this approach is the necessity of using the 
ill-defined concept of a frame of reference in accelerated motion. Our approach 
is free from these disadvantages, being based on the direct application of the 
law of equality of inertial and gravitational masses. It is well to remember 
that in the derivation of Einstein’s gravitational equations we made no use at all 
of any frame of reference in accelerated motion and therefore also no use of the 
principle of equivalence. As for this latter principle, to the extent that it is 
valid it may be obtained subsequently as a consequence of the other assump¬ 
tions. Thus it is implied by the hypothesis that space-time has Riemannian 
character, its mathematical expression being the possibility of introducing a 
locally geodesic coordinate system along a time-like world line. 

We stressed the approximate character of the principle of equivalence. But 
from the point of view of Einstein’s theory of gravitation the law of equality 
of inertial and gravitational mass also is of approximate character, since the very 
concepts of inertial and gravitational mass are approximate. These concepts 
are applicable to the extent to which Newton’s laws of motion and law of 
gravitation are valid and to the extent that it is possible to define any mass as 
a quantity characterizing a particular body independently of its position and 
of the motion of other bodies. In Einstein’s theory of gravitation this is possible 
only approximately, because there the law of motion of material bodies is of a 
more complicated nature. Nevertheless, one can affirm that the law of equality 
of inertial and gravitational mass agrees fully with Einstein’s theory of gravi¬ 
tation, because this law follows from the theory with as much precision as can 
in general be given to its formulation. 

On the other hand Einstein’s theory of gravitation does not reduce to a for¬ 
mulation of the law of equality of the masses; it embraces essential new physical 
principles. There are two such principles. The first is already contained in the 
ordinary theory of relativity: the unification of space and time into a single 
four-dimensional manifold with an indefinite metric. This principle is related 
to the limiting nature of the velocity of light and, closely connected with this, 
to the more precise specification of what is meant by a sequence of events in 
time and also by cause and effect (Section 12). The second principle establishes 
the unity of metric and gravitation; it is the very essence of Einstein’s gravi¬ 
tational theory. 

It is just these two principles, and not any widening of the concept of rela¬ 
tivity, supposedly possible as a result of the local equivalence of acceleration 
and gravitation, which form the basis of Einstein’s theory of gravitation. 
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62. On the Clock Paradox 

To conclude this chapter we consider the so-called clock paradox. We dwell 
on it not because it is an especially important or abstruse question but because 
the paradox has been very widely discussed in the literature and some not 
wholly satisfactory resolutions have been given. 

The paradox arises from using incorrectly the concept of relative motion and 
from ignoring the distinction between inertial and non-inertial frames of 
reference. It consists of the following. 

We imagine a clock A to be at rest in some inertial frame of reference. Let 
a clock B move past A with a constant velocity v and then, after it has covered 
a certain distance, let it be subjected to a negative acceleration which reverses 
its speed so that it again passes A, this time with velocity — v. At the two 
instants at which B is travelling past A the readings of the two clocks can be 
compared directly without the intermediary of light signals. Such a com¬ 
parison will show that clock B is running slow compared to A, or at least this is 
the result of applying the expression for proper time given in Section 14. 

Now, motion is relative. Therefore clock B could also be thought of as at 
rest. Then the other clock, A, must be looked upon as first receding from B 
with uniform speed and then approaching again so that the same equations of 
Section 14 appear to predict also that A should be slow compared to B, in 
contradiction with the previous result. 

The difference in the readings of two clocks situated at the same point in 
space is an absolute and objective fact. It depends neither on the frame of 
reference nor on the method of description. Therefore, as long as they are 
correct, all methods of description must lead to the same result. The contra¬ 
diction obtained shows that an error was made somewhere in the discussion. 

It is not difficult to see that the error consists in failing to take into con¬ 
sideration the fact that in the imagined experiment the clocks A and B were 
subjected to different physical conditions. Clock A did not receive acceleration 
and experienced no impulses whereas clock B was accelerated ; it experienced 
an impulse which reversed the direction of its velocity. In other words the 
error consisted in assuming that the two frames of references connected with 
clock A and clock B respectively were to be treated as equivalent, which they 
are not; only the frame of A is inertial. 

This is the qualitative resolution of the paradox. It has the significance that 
the equation for proper time given in Section 14 

T “jV( i_ $ ,s <62 - oi) 

o 

is valid only in an inertial frame of reference, but not in an accelerated one. 

By what should this equation be replaced in the case of an arbitrary frame 
of reference? The possibility of such paradoxical results will in any case be 
excluded if we take as our formula for proper time the invariant expression 

t 

T = - | V(9oo + 2 ^< + 9u*At) dt 


(62.02) 
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Indeed, now the reading of clock A is obtained by evaluating the integral in 
(62.02) along the path of clock A just as the reading for B is given by the cor¬ 
responding evaluation along its path. Both integrals so obtained are invariant 
with respect to any change of variables of integration, i.e. for any transformation 
of space and time coordinates. The use of different frames of reference, 
inertial or non-inertial is equivalent to evaluating one and the same integral 
with different variables. This fact obviously excludes the possibility of any 
discrepancy in the results. 

There remains, however, the question whether expression (62.01) or its 
generalization to a generally covariant form (62.02) is or is not valid for arbi¬ 
trarily accelerated motion of the clock. We have already remarked in Section 
14 that the physical interpretation of (62.01) as the proper time follows from 
the general principles of the theory of relativity only in the case of a constant 
velocity v. In that case it amounts to measuring a localized process connected 
with a moving point in its “ proper ” frame of reference (the indication of a 
moving clock). If however the velocity v is variable such an interpretation of 
t represents a hypothesis, which, though not in contradiction with the theory 
of relativity, certainly does not follow from it. Indeed, we pointed out at the 
end ol Section 14 that without entering into the details of the clock’s con¬ 
struction no theory is capable of predicting how a clock will behave in conditions 
in which it suffers impacts and acceleration. Therefore one should not expect 
that a formula exists for the reading of any clock in arbitrary motion, in the 
way (62.01) gives this reading for uniform motion in a straight line. 

In such a situation one can adopt either of the following two hypotheses. 
One can either introduce the concept of a clock which is ideally strong and in¬ 
sensitive to impact (Section 14) and one can assume that the physical meaning 
of the proper time determined by equation (62.01) or (62.02) is just that it is 
the reading of such an idealized clock. Or else one can abandon the idea of a 
universal formula for the reading of a clock moving with an arbitrary accelera¬ 
tion and introduce instead the weaker hypothesis that whenever the acceleration 
is caused by a gravitational field the reading of a clock in free motion in this field 
is expressed by (62.02), i.e. by an integral which becomes a maximum just when 
the equations of free motion are satisfied. This hypothesis has in its favour the 
fact that the gravitational field, and only it, is capable of penetrating into any 
material body and of acting on all its parts in proportion to their masses. One 
should also bear in mind that in the free motion of a body in the gravitational 
field its four-dimensional acceleration is zero (this quantity will be defined more 
precisely in Section 63). By virtue of this the second hypothesis amounts to 
saying that the condition for proper time to be given by (62.02) is vanishing of 
the four-dimensional acceleration. It is thus formally the same condition as in 
the absence of a gravitational field. 

We now calculate approximately the reading of the accelerated clock B on 
the basis of the above-mentioned special hypothesis. We use the Newtonian 
approximation to the expression for ds 2 and perform the calculation in the 
inertial frame in which clock A is at rest. According to (51.07) we have 

T = j{ 1 -i(b 2 + P)}* 


(62.03) 



236 


'The Theory of Space Time and Gravitation 

Let V 0 be the constant value of U at tbe position of A. If B passes A for tbe 
first time at t — 0 and then, when returning, again at t — T the difference in 
the readings of A between the passages is 

^ - J (i - >)* - (> - £)r < 62 -»‘> 

0 

Here the influence of the gravitational potential on the rate of A is taken into 
account (see equation (51.14)). For the difference of the two readings of B 
over the same length of time we get 

T 

t* = J {l - i (X + (62.05) 

0 

Therefore B will be slow compared to A by the amount 

T 

- t b = I J (X + U- V 0 )dt (62.06) 

0 

where the integral must be taken along the trajectory of clock B. By the law 
of conservation of energy we have 

X~U = X 0 ~ U 0 (62.07) 

where v 0 is the value of the speed of B at the point where U = U 0 . The relation 
(62.07) enables us to write (62.06) in various different ways, for example, as 

T 

^ i J (J vl + 2U - 2 u 0 ) dt (62.08) 

« 


We now assume for the potential U that in the region considered it has the form 


U = U 0 (for x < asj) 

U = U 0 + g(x x — x) (for x > xf) 


(62.09) 


Let clock A always be at the origin of coordinates and let B move along the 
z-axis. The x coordinate of B will then be 


x = v 0 t (for t < t x ) (62.10) 

X = x x + v 0 (t — <i) — \g(t — t x Y (for t x < t < t 2 ) (62.11) 

and x — x x — v 0 (t — t 2 ) (for t > t 2 ) (62.12) 

Here t x and t 2 are the times at which B passes through the point x = x x on its 
motion to and fro. These times are 


h 



f 2 = h + —? = t x + t* 
9 


(62.13) 


where t* — (62.14) 

g 

is the duration of the uniformly accelerated motion. The time of return of 
B to x = 0 is, by (62.12) 

x x 

2 H-= t 2 + t x 

Vo 


T = t, 


(62.15) 
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In calculating the integral it is most convenient to use (62.08) because in it the 
quantity is constant and the difference 21/ — 2U 0 differs from zero only 
in the region t x <t <t 2 where (62.11) is valid. An elementary calculation 
using (62.14) gives 

^ | (i T - if*) (62.16) 

On the other hand, if we had unjustifiably used equation (62.01) for the 
accelerated motion defined by (62.10) and (62.12) we would have obtained 
(62.16) with a factor (instead of |) in the t* term in brackets. Thus in the 
factor multiplying Vo 2 /c 2 there is only the term \T \i acceleration is neglected; 
the terms \T — \t* give the kinematical effect including acceleration, and the 
full physical effect (kinematical and gravitational) is given by the terms 
| T — §/!*, as in formula (62.16). 

It follows from (62.16) that if the duration of the accelerated motion is f T, the 
clock B is not observed to be slow at all, and for t* = T it is even fast. It should 
incidentally not be forgotten that (62.16) is not general but only derived under 
rather special assumptions concerning the nature of the motion. 

We have performed the calculation in the inertial frame of reference. To 
repeat it in a frame connected with clock B would have no sense for we should 
only be evaluating the .same integrals in terms of different variables. 

In conclusion let us note that in clarifying the clock paradox we deliberately 
avoided using the principle of equivalence. We did so because the approximate 
nature of that principle might give reason to doubt whether any resolution of 
the paradox based on it is exhaustive. It was our intention to avoid all such 
doubts. 
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THE LAW OF GRAVITATION AND 
THE LAWS OF MOTION 

63. The Equations of Free Motion for a Mass Point and their Connection 
with the Gravitational Equations 

In the preceding chapter we have already made use of the assumption that 
in a given gravitational field a mass point moves along a geodesic. However, 
this assumption does not represent an independent hypothesis but may be 
considered to be a consequence of the gravitational equations together with 
an assumption concerning the form of the mass tensor. To obtain this result 
the gravitational equations are required only in so far as they lead to the 
relation 

Tv = 0 (63.01) 

which expresses the divergence-free character of the mass tensor. We can 
obtain the equations of motion for a mass point from those of a continuous 
medium by going to the limit of a concentrated mass. We can take the mass 
tensor 

Tv = I p * M u M v (63.02) 

c 2 

where, as in Section 48, p* is the invariant mass density and w v the four¬ 
dimensional velocity. These quantities are related by the equation of continuity 

V v (p*w v ) = 0 (63.03) 

We shall now give two derivations for the equations of motion of a mass 
point: the first will be based on a variational principle, the second on a direct 
application of (63.01). 

In Section 47 we established a relation according to which the variation of the 
action integral 

s = J c 2 P V( - g) ■ (dx) (63.04) 

was equal to 

S J c 2 p’V (-g)-(dx) = - J p*(u v V v M 0 )Vv / (-9)'( dx ) (63.05) 

where was an infinitesimal displacement vector. (Equation (63.04) differs 
from (47.46) only in its use of the symbol p* for the invariant density.) On the 
other hand, if the same action integral is varied with respect to the g then, 
by (48.03), the result is 

J c 2 p*-v/( - g) ■ (dx) = \ j p*M‘ l M% tlv v'( - 9 ) • (dx) (63.06) 

According to the general relation (48.22) this shows that the action integral 
(63.04) really corresponds to the mass tensor (63.02). 
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In the action integral (63.04) we can go to the limit in which the invariant 
mass density p* differs from zero only in the neighbourhood of a single point 
in space, the volume integral of the density over a volume surrounding that 
point being finite. 

The equation of continuity (63.03) may be written in the form 

= 0 (63.07) 


Multiplying this expression by dx x dx 2 dx 3 , integrating over the volume des¬ 
cribed and using the fact that p* vanishes at the boundaries of the volume, 
we obtain 

d r 

— I p*M°Y / (— g) ■ dx x dx 2 dx 3 = 0 (63.08) 


(we have written t for x 0 to emphasize the meaning of this relation). Conse¬ 
quently the value of the integral 

J p*M°\/(—</)• dx j dx 2 dx 3 = me. (63.09) 

is a constant, independent of time. But the quantity p* differs from zero only 
around one point. Therefore the factor u° may be given its value at that point 
and taken outside the integral. Its value is 


„ dt r 

u° = — = -——--— 

d ~ V(9oo + + g ik x^ k ) 

where the xt are the spatial coordinates of the mass point and the 
derivatives with respect to the time. Using this value of u° we get 


(63.10) 
ii their 


j P V( ~9)-dx x dx 2 dx 3 = m\Z(g w + 2g oi x t + g ik x-± k ) (63.11) 


The action integral can be obtained from this expression by multiplying by 
c 2 dt and integrating with respect to time. Therefore 

t 

S = me 1 J x /(g m + 2 g ot x t + g ik x t x k )-dt = me 2 j dx (63.12) 

t(o) 

This expression differs only by a constant factor from the integral considered 
in Section 38, -the variation of which gives the equation of the geodesic. Thus 
the variational principle leads to equations of motion for a mass point 

8S = 0 (63.13) 


which coincide with the equations of a geodesic. 

We now derive these equations in another way, starting with the fact that 
the divergence of the mass tensor vanishes. 

Equation (63.01) may be stated in more detail as 


~ W( - 9) ■ T*} + ^ W( - 9) ■ T*} + V( - 9) ■ = 0 ( 63 - 14 ) 


Multiplying it by dx x dx 2 dx# and integrating over a volume at the boundaries 
of which the tensor T'^ vanishes, we get 


d 

dt 



g) ■ 2’ 0v dx x dx t dx 3 + 



dx x dx 2 dx 3 = 0 (63.15) 
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Here we can insert the expressions (63.02) for the components of the mass 
tensor. If the previous equation is multiplied by c 2 it then assumes the form 


d 

It 




p*u°u' y/{ — g) ■ dx x dx 3 dx 3 + J *u a vP-y/( — g) ■ dx x dx 2 dx 3 = 0 

(63.16) 


The integrals involved can be calculated by the same means as the integral in 
(63.11). Using (63.09) we obtain 

J p*w°w v \/(— g)- dx j dx 2 dx 3 = mm' 1 (63.17) 

r 7YIC 

and I r^ (J p*M a M 13 -y/( — g) • dx 1 dx 3 dx 3 — -— (63.18) 


where all quantities are to be taken at the position of the mass point. Replacing 
dt by u°dx and dividing by the common factor mcju 0 we obtain from (63.16) 


or 


¥ + = o 


d% 

dr 2 


dz 

„ dx, dx, 
+ r a3 


dr. 


-r~r = °; 7t = mV 

ar dx dx 


(63.19) 

(63.20) 


These are the equations of a geodesic in explicit form. We have thus again 
shown that the equations describing the free motion of a mass point are identical 
with those of a geodesic. Our derivation shows that these equations can be 
obtained directly from the equation V (J T |iV — 0 by integrating over volume 
and subsequently passing to the limit of a concentrated mass. 

Here it is important to note the following. The operation of averaging over 
the volume is generally speaking not unique: it depends on a weight function. 
If before integrating over the volume we had introduced a weight function X 
we could have obtained a result dependent on X. However in the present case 
of a mass tensor of the form (63.02) the factor X cancels out in the limit of a 
concentrated mass, and unique equations of motion are obtained. 

We note that if the ordinary time t — x 0 is taken as the independent variable 
instead of the proper time t and if the previous equations are multiplied by 
(dxjdt) 2 they appear as 


dt 2 dt oe dt dt 0(3 dt dt 


(63.21) 


The equation for v = 0 reduces to an identity. 

The quantity 

dll ' 1 

w' 1 = d~+ (63.22) 

which appears on the left-hand side of the equations of motion is the contra- 
variant acceleration vector of the particle, as it was given by (46.26). 

Let us now clarify the precise meaning of this acceleration. If no gravitational 
field is present the quantity w v is the ordinary acceleration; in Galilean co¬ 
ordinates the spatial components of this vector tend in non-relativistic approxi¬ 
mation to the second derivatives, with respect to time, of the cartesian co¬ 
ordinates of the particle. When a gravitational field is present this will not be 
so. To determine the non-relativistic limit of the w' 1 in this case one can compute 
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the approximate expressions for the Christoffel symbols which follow from 
the expressions for the metric tensor derived in Section 55. It is convenient 
to postpone these calculations until Section 65, and here we only quote their 
result. Taking first the F^, with zero superfix we find that 


isn. ro= „I m 

c 2 8t ’ oi c 2 dxt 


(63.23) 


while the others are small. Among the with a spatial superfix the most 
important are „ TT 

n.= ~~ ( 03 - 24 ) 

Further, we require approximate expressions for the four-dimensional velocity. 
Denoting the ordinary velocity by ^ 

Vi 


we can put, approximately 


and 


dl 


dxi 

IT — —r— = Vi 
(1 T 


dt 

7T- 


1 •!-1 (l V 2 4 - V) 


(63.25) 

(63.26) 

(63.27) 


Insertion of these expressions into (63.22) gives 

( dvt dU 

~ dt dxi 


and 


1 


vr 


, v k 


(dvk __ dU \ 
l. dt 3xJ 


(63.28) 

(63.29) 


(Here summation over k from 1 to 3 is implied.) 

These equations show that the spatial components of the vector w'‘ represent 
the acceleration of the particle apart from the acceleration due to gravity, while 
the zero component is proportional to the work done on the particle in unit 
time by all forces except gravity. 


64. General Statement of the Problem of the Motion of a System of 
Masses 

The problem that will now concern us has already been generally characterized 
in Section 54. We consider a problem of an astronomical type, relating to the 
motion of celestial bodies in empty space. It is known from astronomical 
observations that the distribution of mass over the universe is far from uniform: 
an overwhelmingly large part is concentrated in the form of separate celestial 
bodies at great distances from each other. We shall assume accordingly that 
the components of the mass tensor vanish throughout space, except in some 
separated regions whose dimensions are small compared to their mutual 
distances : each such region will correspond to a celestial body. 

Within each body the mass-tensor must in the first place conform to whatever 
physical model we accept for the body (that of a gas, a liquid or an elastic solid, 
etc.), and it must further satisfy the requirement that its divergence vanishes 

V v r^ = 0 (64.01) 

To achieve conformity with some physical model one must express the com- 



242 The Theory of Space Time and Gravitation 

ponents of the mass tensor in a definite manner in terms of functions of the 
state of the physical system that makes up the body. (Such functions of state 
are density, velocity, pressure, etc., see for instance (55.02).) 

The form of the mass tensor will depend not only on the physical properties 
of the body in question but also on the metric. Therefore its components will 
involve not only the state functions in the narrower sense but also the metric 
tensor. The metric tensor and its derivatives will also enter into the divergence 
expression (64.01). 

Thus, if one wishes to write down the mass tensor one must know the metric. 
But the metric has to be determined from Einstein’s equations which have the 
mass tensor on their right-hand sides. It is therefore evident that only a joint 
determination of the mass tensor and the metric as functions of the coordinates 
can be possible. 

Let us compare the problem we are considering with those encountered in 
Newton’s theory of gravitation. There the most common problem is to deter¬ 
mine the gravitational potential from a mass density that is assumed to be 
known. But there are other more complicated problems in Newtonian theory 
in which the potential must be determined simultaneously with the density. 
A famous problem of this kind is that of the shape of a mass of liquid in 
rotation. This problem was studied by Liapunov in Russia and by Jeans and 
Poincare elsewhere. Its rigorous solution was given in Liapunov’s posthumous 
work [21, 22]. The problem we are concerned with is reminiscent in its general 
character of Liapunov’s problem, but in our case one has to find two tensors, 
the metric tensor and the mass tensor, instead of two scalars, the Newtonian 
potential and the density. In addition, we must deal not with equilibrium but 
with motion. It may be noted that in Section 73 we shall also encounter the 
basic equations of Liapunov’s problem. 

Our problem is simplified in the first place by the fact that the metric nowhere 
deviates greatly from the Euclidean ; the table given in Section 58 gives an 
idea of how small the deviation is. A further simplifying circumstance is 
that at all significant distances from the bodies, the metric does not depend 
on the detailed internal structure of the latter, but only on certain overall 
characteristics. Such characteristics are the total mass of the body, its moments 
of inertia, the position and velocity of its mass centre and so on. The Newtonian 
potential of a body depends on these same quantities. 

To solve Einstein’s equations we shall use a method of approximation which 
is a development of the method of calculation employed in Section 55. It is 
based on an expansion of all required functions in inverse powers of the speed 
of light. An expansion that can formally be so described will, in fact, be an 
expansion in powers of certain dimensionless quantities, such as C//c 2 and 
v 2 jc 2 where U is the Newtonian potential and v 2 the square of some velocity, 
say the velocity of one of the bodies. For systems to which the virial theorem 
applies the two quantities U and v 2 will be of the same order, of order q 2 say, 
where q is a parameter of the dimensions of a speed. (We have already used this 
parameter in Sections 55 and 58.) Then the quantity q 2 /c 2 can be used as the 
dimensionless parameter in terms of which the expansion proceeds. 

If we solve wave equations by introducing corrections for retardation we 
imply that the dimensions of the system are small compared to the wavelength 
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of the waves emitted, which in this case are gravitational waves. This assump¬ 
tion is not independent of those made above. For if the angular velocity of the 
revolution of a planet is denoted by w and the radius of its orbit by R we can 
write the condition for the smallness of the system as R c/io since c/a> is the 
wave length of the gravitational wave divided by 2xr. On the other hand, the 
condition that the speed v = Rio of the planet shall be small compared to the 
speed of light has the form Rio <<; c, which is the same as the previous inequality. 

In astronomical problems we are interested in “ quasistationarv ” states of 
the gravitational field, i.e. states which are established after many planetary 
revolutions. The solution of the wave equation obtained bv introducing cor¬ 
rections for retardation satisfies the conditions for a quasistationarv state. 

We have already remarked that in most astronomical problems the distances 
separating celestial bodies are extremely large compared to their linear dimen¬ 
sions. If R is a length characterizing the order of the separations and L a 
length characterizing the linear dimensions of the bodies we have the inequality 

L <^R (64.02) 

The use of this inequality introduces considerable simplifications into the 
calculations of the gravitational potential and of the metric tensor outside the 
masses : as was mentioned, these quantities become independent of the detailed 
internal structure of the bodies. Therefore, we shall also use the inequality 
(64.02) though it is less essential than the inequalities v 2 c 2 and U <4 c 2 on 
which our method of solving Einstein’s equations is based. 

We note that on the surfaces of the bodies and in their interiors, where the 
gravitational potential is greatest, we shall have, in orders of magnitude, 

U/c 2 = a IL (64.03) 

where a is the gravitational radius of the body (see the table in Section 58). 
Therefore the two inequalities to be used can be written as 

a. L <£. R (64.04) 

Our problem consists in determining the metric tensor and the mass tensor. 
If we know the components of the mass tensor as functions of the space and 
time coordinates we also know the motion of the masses, for the masses are 
just the regions in which the mass tensor differs from zero. It is essential to 
note that the motion of these regions cannot be prescribed in advance; the 
laws of motion are a consequence of the laws of gravitation. 

The solution to our problem gives us, firstly, some approximate expressions 
for the mass tensor and the metric tensor which involve certain unknown 
functions ; secondly, it gives us equations for these unknown functions from 
which they can be determined, given certain initial conditions. These are the 
equations of motion. 

In the course of solving the problem it becomes apparent how to choose 
these unknown functions. The procedure of solution leads in a natural way to 
just the quantities that are used in Newtonian mechanics, such as the mass 
centre coordinates of each body, its total mass, its angular momentum, its 
moments of inertia and all the other quantities characterizing it. 

For all these quantities equations of motion are obtained which, to first 
approximation, coincide with those of Newtonian theory and to second approxi- 
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mation deviate from them by small corrections. We shall be interested both 
in obtaining these corrections and in expressing the metric tensor and other 
quantities of Einstein’s theory in terms of Newtonian quantities. 

65. The Divergence of the Mass Tensor in Second Approximation 

We shall attack the problem of determining the mass tensor and the metric 
tensor in a series of steps, starting from the discussion of Section 55. We recall 
that discussion. In the initial approximation the metric is taken to be Euclidean, 
which corresponds to complete neglect of the forces of gravity. In that approxi¬ 
mation it was possible to prescribe only the components T 00 and T w of the mass 
tensor. According to equation (55.03) these components appear in Galilean 
coordinates as 

= T°* = 4 ?v t (65.01) 


where p is the density and vt the velocity of the ponderable matter at any point, 
with 


£p Y a (p v i) = 0 

C't A-! dXi 

l — l 


(65.02) 


The spatial components T ik were not determined in this approximation; 
it was only assumed that their order of magnitude is the same as in the absence 
of gravity. 

These assumptions concerning the mass tensor proved to be sufficient to 
determine the first approximation for the metric. According to (55.31) we have 

<7oo = c 2 - 2f7 

Ooi - 4 U t (65.03) 

c 


9lk = — 


(‘ 





where U is the Newtonian potential of gravitation satisfying the equation 

A U = - 4jtyp (65.04) 

while U( is the vector potential of gravitation satisfying 

AU { = - 4:r T p vt (65.05) 

It must be remembered that the potentials U and are related to the mass 
density p and the mass current p Vt in a non-local manner; the value of the 
potentials within a given body depends on the distribution of density throughout 
space and not only within the body itsel f. 

In Section 55 we also gave equations for the contravariant components of 
the metric tensor, namely 



(65.06) 
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We had also, according to (55.37), 


„ x W 

V(~9) = c + — 

c 


(65.07) 


Knowing the metric to the approximation of these equations we can take the 
next step in the construction of the mass tensor. To this end we must first 
write down more precisely an expression for its divergence. This is the aim of 
the present section. 

According to (41.24) the general expression for the divergence of a symmetric 
tensor has the form 

gyiiv 

V- = — + I’SpZ'# + yT^ (65.08) 

where for brevity we have put 

y v = — tlog V(-^ )] = r v a a (65.09) 

In any case the order of magnitude of the components of the mass tensor is 
given correctly by the equations (55.02). We have 

T#0 = o (£) ; 101 = °(| ?); Tlk -°(^ 2 ) ( 65 - 10 ) 


where q is the parameter used before which characterizes the order of magnitude 
of the velocity. 

Let us consider the zero component of the divergence of the mass tensor. 
It is given by (65.08) with p = 0. In our initial approximation we neglected 
all but the derivative term in this equation. To take the next step we must 
know the quantities r° 3 and y up to terms of the order of 1/c 2 inclusive. As 
for the spatial components of the divergence, they were neglected completely 
in the initial approximation. To take them into account in first approximation 
we must now find the leading term of r* p , which does not contain a factor 1/c 2 . 
For the determination of «/ v , F° p and r' 3 to the required accuracy the approxi¬ 
mation represented by (65.03) to (65.07) is sufficient. 

From (65.07) and (65.09) we readily obtain 


2 8U 2 8U 

Vo J r . i yi » ~z 
C 2 ot C 2 8X{ 


(65.11) 


If we then calculate the Christoffel symbols from the well known relations 


r& = <rr v> 


0tS 


(65.12) 


where 


‘ v»a£ 


(^a 

Xdxp 


l 

dX a 




(65.13) 


it is easy to see that in the last equation our approximation allows us to neglect 
all terms containing the factor 1/c 2 . There then remains 


r = 

A 0,00 


317 

~8t’ 



(65.14) 


i,00 


and also 


811 
8x{ 


(65.15) 
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Hence we get for r“ ; , : 


00 c 2 81 ’ 


F o( = 


1 8U 

c 2 dxi 


whereas the will be of higher order of smallness, namely 

- o(i) 

Among the Christoffel symbols with a spatial superfix there occurs 

8U 




while all the others will be small: 

0 


pi 
1 ok 




(65.16) 


(65.17) 


(65.18) 


(65.19) 


Having found the values of the Christoffel symbols we can now write down the 
required expression for the divergence of the mass tensor. 

Equation (65.08) with p = 0 may be stated in more detail as 


gT 00 8T oi 

Vy° J = —fT + Iff + (r °°o + yo) T °° + (2F °°* + y < ] ^ + l '« Tik 


By (65.11) and (65.16) the coefficient of T 00 in the above is 

™ 1 8U 

r oo + 2/o 

As for the coefficient of T oi , it proves to be negligibly small, namely 

2H+y, = o(i) 

By (65.17) the coefficient of T ik is of the same order. 

Thus in our approximation equation (65.20) assumes the form 

gyoo d T«i l aff 

V T ov _I_I_ t 00 

v 8t dxi c 2 8t 

For the spatial components of the divergence, equation (65.08) with p = i 
gives 

8T i0 8T ik 

V* = ~sT + + ri oo T °° + + Vo TM + n, Tkl + 2 l* Tile 

(65.24) 

As may be seen from the estimates discussed above, the coefficients of the last 
four terms on the right-hand side will all have a factor of 1/c 2 so that these terms 
may be discarded. Expressing r* 0 by (65.18) we therefore get 


(65.20) 

(65.21) 

(65.22) 

(65.23) 


V T iw 


8T l o 8T ik 8U 


8t dxk 8xt 


Too 


(65.25) 


Thus, if the deviation of the metric from the Euclidean, or, in other words the 
force of gravity, is taken into account, to first order, we must express the 
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condition of vanishing divergence by the equations 
8T 00 


dt 


8T 0k 1 8U 

— _1-_L _ T 00 = 0 

dxic c 2 dt 


and 


8T i0 8T ik 

+ 


dt 


8U 

—_ r°° = o 


dxic 8xt 


247 

(65.26) 

(65.27) 


66. The Approximate Form of the Mass Tensor for an Elastic Solid 
with Inclusion of the Gravitational Field 

In Section 32 we obtained approximate expressions for the mass tensor of an 
elastic solid or fluid body, neglecting gravitational forces, by studying Umov’s 
scalar and vector of energy density and current. These expressions were then 
written down in Section 55 in the notation we are now using (x 0 = t). According 
to (55.02) they have the form 


c 2 r°° = P (l + ^(i« 2 + H)j 

C 2 T 0i = p Vi jl + ^ 2 (i « 2 + H)j — ^ PilcVk 
C 2 T ik = pViVic — Pik 


( 66 . 01 ) 


We note that here pik denotes the three-dimensional elastic stress tensor and 
II the elastic energy per unit mass of the body. According to (30.08) these 
quantities satisfy the relation 


dll 18Vi 8vk\ 

l&r. + W 


( 66 . 02 ) 


We now have to generalize the expression for the mass tensor so as to take 
the gravitational field into account. According to Newton’s theory the negative 
of the Newtonian potential is also the potential energy of a particle of unit 
mass placed in a given gravitational field. Therefore one would expect that the 
required expressions for the energy density and energy current may be obtained 
by adding the terms (—p U) and (—pvtU) respectively to the expressions for 
Umov’s scalar and vector in equations (30.17) and (30.18). This gives 


S — |pu 2 + p(II — U) (66.03) 

and 

Si = Vi {ipv 2 + p(IT — U)} — paVk (66.04) 

The components of the mass tensor can then be obtained from the relations 


c 2 T 00 = p + i S (66.05) 

and 

c 2 T oi = pin -f S t 
c 2 


(66.06) 
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We shall thus finally have 

c 2 T 00 = p {l + + n - c/)J 

c~T oi = p Vi Jl + i(-^ 2 + 11 - Z7)J - I p ik v t (66.07) 

c 2 T ik = pv ( «* — p ik 


If our argument is correct, then to the approximation required these expres¬ 
sions must satisfy the equations derived at the end of the preceding section, 
namely 


8T 00 8T 0k 1 8U 

-1-1-T 00 = 0 

8t 8xk e 2 8t 


8x k c“ 

8T i0 8T ik 8U 

_i__ yoo . 

8t dxjc dxi 


0 


(66.08) 


These equations must be satisfied by virtue of the equations of motion for the 
quantities involved in the mass tensor. We have, firstly, the equation of con¬ 
tinuity 


dp 8(pvi) = o 

8t dxi 


(66.09) 


and, secondly, the equations of motion of an elastic body in a gravitational 
field of acceleration dUldxt: 


dvi 8U _ dpoc 
^ dt ^ 8x{ 8x k 


( 66 . 10 ) 


In addition, the relation (66.02) between the stress tensor and the elastic 
potential energy is valid. Using (66.09), (66.10) and (66.02) we find that Umov’s 
scalar and vector are related by 


8S 8St _ _ 8U 

8t 8xt ' dt 


( 66 . 11 ) 


whence with the use of (66.09) we obtain 

8T 00 8T 0k 1 8U 

dt dxk c i ^ dt 


( 66 . 12 ) 


Since p = c 2 T 00 approximately, equation (66.12) and the first equation of 
(66.08) coincide in the approximation in question. As for the remaining 
equations of (66.08), after multiplication by c 2 they coincide approximately 
with the equation of motion (66.10) written in the form 


d(pv { ) 8(pViV k ) __ dptk _ 8U 

dt 8x k dxjc ^ dxi 


(66.13) 


(here it is sufficient to take the leading terms in the expressions for T i0 and T 00 ). 

Thus the equations (66.08), which express the fact that the mass tensor 
has vanishing (non-Euelidean) divergence, are indeed approximately satisfied 
if the quantities (66.07) are taken to represent the mass tensor. 

In the equations (66.07) the density p satisfies the equation of continuity 
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(66.09). The corresponding generally eovariant equation has the form 

^{v(-j)' P V) = 0 (66.14) 

where p* is the invariant density and w“ the four-dimensional velocity. To 
make (66.09) and (66.14) coincide we can put 


pVi =i V / (- 9 r)-p*M‘ 

C 


(66.15) 


(The factor 1 jc has been included so as to make p* approximately equal to p.) 
Noting that by (65.07) and (63.27) 

~V(~9) = 1 + ^T> «° = 1 + K(l v2 + U) (66.16) 

c C z C“ 


we get, approximately 

p* = pjl - I(Jt>* + 3 f/)J ( 66 . 17 ) 

The equations (66.07) allow one to write down an expression for the invariant 
of the mass tensor. Introducing the pressure by the equation 

P = - K^xi + Vvi + Pss) (66.18) 

we get T = p jl + - ic 2 + 11 - 31/)} ^ ~ p (66.19) 

and using the equation (66.17) for p* 

r = p*(l+H)-^p (66.20) 

This last expression agrees with (32.30). 


67. Approximate Expressions for the Christoifel Symbols and Some 
Other Quantities 

In order to take the next step in the determination of the metric tensor 
we must extend the calculations of Section 55. As all our calculations will be 
done in harmonic coordinates it is convenient to use as unknown functions the 
contravariant components of the metric tensor multiplied by y/(— g), i.e. the 
quantities 

0 a ' = V(- 9)-r‘ (67.01) 


The advantage of using these quantities comes from the fact that they permit 
the harmonic condition (55.79) to be stated in the simple form 




cx„ 


(67.02) 


so that it is linear in the unknown functions. A further advantage of this 
choice of unknown functions is that the spatial components differ very little 
from constants. Finally it is also most convenient that the left-hand side of 
every one of the gravitational equations involves the d’Alembert operator 
applied to the appropriate component g |iV , so that approximately each com- 
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ponent is connected only with the single component of the mass tensor having 
the same indices. 

In Section 55 (equations (55.38)) we found approximate expressions for 
the g 1 ": 



+ 


4Z7 


0°i = i Vi (67.03) 

c 3 

= — C0uc 


Here V is the Newtonian potential and Z7< the gravitational vector potential. 
We now use more accurate expressions obtained by adding the next terms in 
the expansion in inverse powers of the speed of light. We have then : 


4t7 


a 00 = - + ^ + nr 

c c 3 c 5 


45 

7* 


9 


45* 


c 3 c 5 


(67.04) 


9 ik = —+ 


4Stic 

c 3 


Using these expressions we can calculate the Christoffel symbols and the 
other quantities that enter the gravitational equations. 

We begin by calculating the determinant g. As is easily verified (see equation 
(B.67)) it is equal to the determinant of the g t ‘ v . The equations (67.04) give 


fl+1? + 4 £ -4&q 


(67.05) 


Here we have used the summation convention for spatial indices and have put 

3 

Skk — 2 ^kk — S lt + S 22 + S 33 

k=l 

Extracting the square root we obtain further 

Vi-9) = 

We introduce a special symbol for the fourth root 
By (67.06) this quantity is 


2U 25 - 2 S k k - 2 U* 

I n I « 


) 




(67.06) 

(67.07) 

(67.08) 

(67.09) 


The quantity / will approximately satisfy a linear differential equation which 
will be derived in the next section. 

From (67.04) and (67.07) we obtain for c 2 g°° the expression 


2 V 


2 S + 2 Skk - 2U 2 



The Law of Gravitation and the Laws of Motion 


we have with the same accuracy 


and also 


c 2 g 00 — 


1 

TiS'oo 


c 2 + U* 
c 2 — V* 
c 2 - U* 


c a c 2 + U* 

We shall encounter the quantity V* in the following. 
We now pass on to the calculation of the quantities 


ip.* 


-s( 


z 9 \ s ’’-k + * -k 


a8 , | 

8x„ J 
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(67.12) 

(67.13) 


(67.14) 


These are introduced in Appendix B and are related to the Christoffel symbols. 
Using (67.04) and (67.05) we obtain the following set of equations 


2 8U 

JJO, 00 _ 

c 6 8t 

no.o , = 


c 4 8xi 


(67.15) 


J]0, kl 


and 


pji, oo 

n<. 


1 + ?Ei) 

c 4 \ 8xi 8xic) 

477^ + 2^)] 
c 4 \8xi c 2 \8xi 8xt 8t )) 

) 


2 (8Vt 

_ 8Uk 

1 ! 

f\ 

1 

8xi, 


(67.16) 


=°Q 


If we lower the second and third superfix we obtain the 11^. We find 

n o_ 

11 00 — 


II* = 

n?. 


c z ot 
2 8U 
c 2 8xi 


(67.17) 


“ = °© 


and 


n‘ = 


2 I£! + C “ 2 


8S 

8xi 


-8U — + 2 —* 

8xt 81 


. _ 2/8Ui 8UA 

k0 ~ ~ ?\Hxl ~ -foil 

n ““°Q 


(67.18) 


In calculating 11^, the value g 00 — c 2 — 2U is needed, but for the calculation 
of the other quantities it is sufficient to use the Galilean values of the metric 
tensor components. In the equations for the IIJJp we have restricted ourselves 
to terms of the order 1 /c 2 . 

In order to evaluate the Christoffel symbols we require not only the II 
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a log y/(-g) 


y* = 


ox 


y' 1 = g m y* 


(67.19) 

(67.20) 


(.see Appendix D). We state the approximate values ot the //,. Differentiating 
(67.05) vve obtain 

2(8U 


i/as w eu ds kk \ 

'>\dxi 8xt dxi) 




(67.21) 


' 


Retaining terms up to the order of 1 /e 4 we find the following values for the y* 
with upper indices 


yi 


IdU 

fdS 

oT7 8U 


\dx i c 2 

\8xi 

- 6C7 -- 

8x { 

8x t )\ 


2 81J 


(67.22) 


y--i 


c 4 8t 


The Christoff el symbols are expressible in terms of the quantities just 
derived as 

r& = n& + (67.23) 

If we confine ourselves to terms of order not higher than 1/c 2 we find for p — 0 
just as in the approximation (65.16): 


F° — — — 

1 nn — ~ 


1 8U 


c 2 81 ’ 


r» _ 

04 c 2 8x t ’ 


^lk 




(67.24) 


whereas for p = i we obtain expressions that are more accurate than (65.18) 
and (65.19): 


TL = 


8U 

8x t 


1 / dS 8S kk — 8U 

8xi 8xt 8x, 


i( 




n -U 

ok c 2 a< C 2 v 


2 /££< _ gt7 * ' 
a** 


i 

r *‘=^ (ii +s “) 


(67.25) 


I«i approximate calculations it is convenient to use equations arranged in 
such a form that the II a | are involved rather than the I^S. The advantage 
of the former quantities over the latter may be seen by comparing (67.18) 
with (67.25). These equations show that the II|. ( may be neglected, but the 
r* w may not. 

In Appendix 1) we obtain an equation for the Einstein tensor which has 
been brought to the form described: this is equation (D.87). It involves the 
Lagrangian, which by (D.95) is equal to 

1 8a ati 

L = - .: U'L JL- + lyy (67.26) 


2 'V(-y) “ e a*. 

We derive the approximate value of this expression. From (67.21) and (67.22) 
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8U\(8U , 1 8(S - S kk )Y 


„ 4 isuy 4 / 8V\(8U 1 

9)-yy-yy + ? 


(67.27) 

On the other hand, if we use (67.17) and (67.18) in conjunction with the initial 
expressions (67.04) for the g |lv we find 

1 np S 8 a0 __ 4 18U\ 2 16 8U 8Ui 

2 a13 dx„ c 5 \ dtj c 5 8xi 8t 


1 np S 8 a0 __ 4 /0E7\ 2 16 0£7 at7 4 

2 a13 8a; p c 5 \ 8f / c 5 8xi 8t 

+ ‘( 1 _!E)ff + ia-_u 

c 3 \ c 2 / c- c/ify/ c J \ 


au<_aE /*\ 2 

C/Zfc SXj / 


(67.28) 


Adding this to one half of the preceding equation we obtain for \/(— < 7 ) times the 
Lagrangian 


6 (8Vy 16 8V eVi 4 (dUi sv k y 
+ c 5 \ 8t j + c 6 8x { 8t c s \8x k 8xi / 

Introducing U* from (67.11) this expression may be written as 

2 (8U*y 6 / 8U*\ 2 16 8t7*8l7| 4/8^ 8V k \ 2 

9 c 3 \ 8xi j c 5 \ 8t ./ ^ c 6 dxt 8t c 5 \8x k 8xt ) 

(67.30) 

Here we have replaced U by U* in the correction terms. The last equation 
is remarkable in having on its right-hand side a homogeneous quadratic form 
in the first derivatives of the four quantities U* and Vt with constant coefficients. 
By comparing (67.27) and (67.29) we see that the sum 

V(-9)-( l + &y«y “) 

= 8 (8U dS^c _ 1 /8Ui _ 8UA 2 f8U \ 2 ^ 3EM 1} 

e 5 1 8xi 8xi 2 \ 8x k 8xt / \ 8t ) dxt 8t ) 

will be of a higher order of smallness than (67.27) and (67.29) separately. This 
remark enables one to derive a very simple approximate expression for the 
curvature scalar R. According to the relation (D.49) we have in harmonic 
coordinates 

R = 8H °gV( ~9) _ L (67.32) 

$X ()Xn 


R = g* 


(67.32) 


Using the definition (67.08) we may write 

v( 3 ) = <r- 

wlience we get 

2 8-f 2 

D ~a8 J 


But we have 


R= _ i _ 

J **a 8x » f* 


f 1 8x a 8x & 


(67.33) 


(67.34) 


(67.35) 
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Therefore 


c / : 


•a 9 


ay 

8x a dxp 


- ( I vX + L ) 


(67.36) 


We now pass to the approximate equations. According to (67.31) the quantity 
i\yX + B) w iH he of sixth order in 1/c. Discarding quantities of sixth order 
we can replace the g a@ by their Galilean values. The curvature scalar R then 
assumes the very simple approximate form 




(67.37) 


where A is the Laplace operator with Euclidean coefficients. 


68. Approximate Form of the Gravitational Equations 

The calculations of the preceding section permit us to write the left-hand 
side of Einstein’s gravitational equations 

- \g^R = — —y- T^ (68.01) 

c 


in the approximation corresponding to the required accuracy. Let us first 
write down the value of the left-hand side without any approximations. 
According to (D.87) we have 

1 d 2 a |lv 

- X V R = 5- <f 3 w-V + n^njk - yy 

*9 GXp 

+ \r‘L + y»B - ( 68 . 02 ) 

Here L is the Lagrangian 

1 aq a@ 

L = — ——-- n^ a X- + W (68.03) 

3 c>x y v 


which was already encountered in the preceding section. According to (D.85) 
the quantities B |iv are given by the equations 


B w ‘ = D" + + yT“) = i(V<* + rf) T* + i(V» + y v ) P* 

where the T v are the quantities introduced in Sections 41 and 53 : 

P= _ 

V(-ff) 8x11 

The quantity B is the “ quasi-invariant ” formed from the H 1 * 7 


(68.04) 

(68.05) 


B = 9 ^ = (V v + y v ) P (68.06) 

It is not a true invariant because the J3 |i ' 1 do not form a tensor. 

In a harmonic system of coordinates the F v vanish and, therefore, also the 
P* v and so do the B' 1 ' 1 and B. 

We note that as long as the left-hand side of Einstein’s equations (68.01) 
contains the complete expression (68.02) for the conservative tensor, the 
equation 

V |1 T |iV = 0 


(68.07) 
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is a consequence of the equations themselves. However, if from the start 
we drop the terms in B |iV and B from the conservative tensor the equation 
(68.07) will only be true if the conditions P = 0 are satisfied. 

Passing to the approximate form of Einstein’s equations we shall first consider 
the terms involving second derivatives. Using (67.04) we get 




d 2 ep 1 3 2 <p 4 / 3 2 tp 

^ = -cAcp + - — + -{S* 


+ 2 U t 


3 2 <p 
dXi 8t 


+ U 


8\\ 

8t 2 ) 

(68.08) 


Here the leading terms combine to give c times the Euclidean d’Alembertian 
while the terms with variable coefficients represent corrections which may be 
neglected in the approximation considered. We consider in more detail the 
order of magnitude of these corrections. The expression (68.02) for the Einstein 
tensor involves the quantity (68.08) divided by 2 g, if the corresponding com¬ 
ponent of is inserted for 9. The derivatives of 9 will be of third order and since 
g = — c 2 approximately, the division by 2 g makes the correction terms in (68.08) 
of eighth order in 1/c. Since we intend to neglect terms of this order we must 
evaluate all other quantities involved in (68.02) to a corresponding accuracy. 

To avoid denominators involving g we shall evaluate not the Einstein tensor 
itself but, instead, that tensor multiplied by a factor which differs little from 
unity, namely (— g/c 2 ). Then to the required accuracy the terms involving 
second derivatives will be 


1 

2c 2 




02g|W 

8x a 8x 3 



1 3 2 gi*v 
2c 5 8t 2 


(68.09) 


In evaluating the terms involving first derivatives we shall first use the relations 
(67.15) to (67.18) to obtain the quantities 
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8xi) 8x s 
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c 2 8t J\8x k c 2 8xic 
8/8U t SUs^dUt 
c 6 \ 8x s 8xi 


8>S 2 8U]c\ 

+ c 2 ~8f) 


\ iMh __ 8Us\ 

) \ 8x s 8x k j 


( 68 . 10 ) 

( 68 . 11 ) 


( 68 . 12 ) 


In the equations (68.10) and (68.11) we need not have included the factor 
(— g/c 2 ) because in the present approximation it may be replaced by unity. 
Further, using the expressions (67.22) for and introducing the notation 

= (- ^){n^ii; 0 - jyy + y^L} 


(68.13) 
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we obtain 


N 00 


N<* 


7 (8Uy 
e 8 \8x s ) 

6 8V 8U 8/dUi 8U S \8U 

c 8 8t dxi c e \ 8x s 8x{) 8x s 


n«.U a Il + l - d Ti\( a Il + t s !h\ 
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where 
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1 „ 2 /0*7*\* 6 /a?7\ 2 16 


(68.14) 


(68.15) 


(68.16) 


(68.17) 


2 c 

16 877 811, 

8x s 8t 

4 /8U„ su r y 

C 6 \ 8x r 8xi) 

and U* is the quantity given by (67.11). Since U enters these equations through 
terms of sixth order it can be replaced by U* as was done in (67.30). Then the 
only first derivatives contained in the N are those of the four quantities U* 
and U s and all the N 11 '' 1 are homogeneous quadratic functions of these first 
derivatives, with constant coefficients. This represents a very great simplifica¬ 
tion of the exact equations (68.13). 

With the use of the expressions found we can immediately write down the 
approximate form of Einstein’s equations ; we have 




2c 3 8t 2 


„ 8*7/0 

° _j_ jfv# _*£ 


(68.18) 


where the have the values (68.14) to (68.16). 

In order to use equation (68.18) for the determination of the g |iv the quantities 
U* and Ui appearing in the N •* v must first of all be evaluated to the accuracy 
required. 

As regards the U{ they enter into the expressions (68.14) to (68.16) only 
through terms of sixth order; it is therefore sufficient to know them to the 
accuracy reached in Section 55. The relevant equations were also given in 
Section 65. (These equations can be obtained from the equation (68.18) with 
jjt = 0, v = i by neglecting N 0( and the second time derivative, replacing g 
by — c 2 , retaining only the leading term in T m and expressing a 04 in terms of 
Ui.) We have by (65.05) 

A Ui = — iirfpVi (68.19) 

On the other hand U* appears in terms not of sixth, but of fourth, order and 
it must be known with greater accuracy. By the definition (67.11) we have 

U* = U + - (S + S kk - 2C7 2 ) (68.20) 

c z 


According to (67.04) the derivatives of U + S/c 2 are equal to the derivatives 
of c 3 g°°/4. Therefore equation (68.18) for p = v = 0 gives : 

( 4 -? £f ( v +? s ) - - r N “+< 68 - 21 > 

Further, the derivatives of S kk jc 2 and of cg**/4 are equal. Putting p = v = A 
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in (68.18) and summing over k we get 

( 1 3 ^ \ 1 2 

A - ~ 2 gji)-5 | ^ - 47tyT“ (68.22) 

(In the second term on the right we have replaced (— g/c 2 ) by unity.) Finally 
we can write 

/ i d 2 \/—2U 2 \ 4 

r ~ c* si?) (—) = - ? (grad U) 2 + 167 ^ r °° (68 - 23) 

Here we have neglected the second time derivative in the d’Alembertian and 
have used the identity 

A (U 2 ) = 2(grad V) 2 + 2U A U (68.24) 

and also the equation 

A U — — 4ttyp = — 4tcyc 2 jT 00 (68.25) 


The required equation for V * may be obtained by adding the three equations 
(68.21) to (68.23). We calculate the sum of the right-hand sides. By (68.14) 
we have 

- 1 N °° = i (grad 11)2 (68 ' 26) 

Calculating to the same accuracy we obtain from (68.16) and (68.17) 

- j N** = ij (grad V) 2 (68.27) 

and therefore 

— A r °° + — N kk + — (grad f/) 2 = 0 (68.28) 

2 2 c 2 

Using this relation as well as the equation 

47ry<7 + 1 firry U = — 4uyc 2 (68.29) 

we get for V* the simple equation 

1 8 2 U* 

AU* -= - A~Ac 2 T aa + T kk ) (68.30) 

c 2 8t 2 


Outside the masses U* satisfies d’Alembert’s equation with Euclidean 
coefficients. This could be expected in view of the form (68.17) of the Lagrangian. 

At the end of Section 67 we found an approximate expression for the scalar 
of curvature R in terms of the function 

(68.31) 



which is proportional to the fourth root of the absolute value of the deter¬ 
minant g. We have, approximately 
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If this is combined with the relation 


(68.33) 


which follows from Einstein’s equations, and in which T is the invariant of 
the mass tensor, the preceding equation gives 


J c 2 at 2 c 2 J 


(68.34) 


Let us compare this relation with the expression (68.30) for U*. We put 

TJ** 

f = 1 +— (68.35) 

c 

where U**=U + ^(S - S kk - §17*) (68.36) 

as a comparison with (67.09) shows. Like U* the quantity V** is equal in 
first approximation to the Newtonian potential U. Equation (68.34) now 
assumes the form 


A U** - - 
c 2 


8 2 U** 

8t 2 



To the same degree of accuracy we can write 
1 8 2 U** 

Al/** ~ -2 -QjT- = ~ 4 ^ {( c2 + U ) T °° - Tkk ) 


(68.37) 


(68.38) 


This last equation can also be derived directly from (68.21) to (68.23) and from 
the definition (68.36) for U**. 

If the invariant T has the form (66.19) we have 

(> + T = t {> + + ">} - | V ( 68 - 39 ) 

By use of this equation together with (68.37) it can be shown that in the approxi¬ 
mation considered the quantity TJ** is an additive function of the masses. 

It is easy to see that the mean of (68.20) and (68.36) 

V = l(U* + U**) (68.40) 

satisfies the equation 

ACr - ^ = - 47 t T (° 2 + i U '> T °° (68.41) 


This equation has only one component of tue mass tensor on its right-hand 
side ; it is related to the Einstein equation involving the same component, 
while the quantity U itself is related to g 00 . Indeed, by multiplying together 


the expressio 


1 c 2 + U* 
G 2 c 2 — TJ* 


V(-9) 


( U**\ 


(68.42) 
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and remembering that V* and U** differ from one another and from V by 
terms of order 1/c 2 we get 

V(~9)-ff 00 - 9 00 = - c + U + 1 V 2 (68.43) 


where V is a solution of (68.41). 

It is not difficult to verify equation (68 43) for g 00 . For if g 00 has this value, 
then 


1 

2c 




1 d 2 g°° 
~ 2c 2 ~8tF 




(68.44) 


Here we can replace V by V in the correction terms and we can discard 
(8UI8t) 2 . Using the value of N 00 given by (68.14) we obtain 


1 S 2 q 00 2 / TU\( ^ 1 8 2 V\ 

Aq 00 -2_ + 2V°o = _(l _|-| (AC?-1 (68.45) 

y 2c 3 8t 2 x e*\ 2c 2 / \ c 2 8l 2 ) K 1 


2c 


If we compare this relation with Einstein’s equation (68.18) for p, == v = 0 we 
arrive at (68.41). We may note that to the same order of approximation to 
which (68.43) holds we can put 




(68.46) 


For the case of a static field of a concentrated mass M we obtained a rigorous 
solution in Section 57. A comparison with that rigorous solution shows that if 
one inserts 

U* = V** = V = ^ (68.47) 

r 


into equations (68.42) and (68.46) they reduce to the exact expressions (see 
(58.10) and (58.13)). The expressions (68.47) agree with d’Alembert’s equation, 
which indeed they must satisfy outside the masses, in the approximate theory 
considered in this section. 


69. The Connection Between the Divergence of the Mass Tensor and the 
Quantities T v 

We have already mentioned at the beginning of the preceding section that 
if in Einstein’s tensor the terms containing the r v are omitted at the outset, 
the divergence of the mass tensor will vanish, i.e. 

V|i 2W = o (69.01.) 

only provided the harmonic condition 


8x,, 


= 0 


(69.02) 


is satisfied. In order to study the equations of motion it is necessary to examine 
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the connection between the conditions (69.01) and (69.02) or, more precisely, 
between the left-hand sides of these sets of equations. 

If no terms are neglected the left-hand side of (69.01) may be written as a 
rather complicated differential expression involving the left-hand sides of 
(69.02). However, we are interested not in these rigorous relations but in 
approximate ones that are valid to the accuracy to which Einstein’s equations 
were given in the preceding section. We now pass on to the derivation of these 
approximate relations. 

The expression for the divergence of a symmetric tensor was developed in 
detail in Section 65. According to equation (65.24) the spatial components of 
the divergence are 

8T i0 8T ik 

+ — + r'ooT 00 + 2142 ™ + yj 0i + Tkl + yj™ ( 6903 ) 


V 

11 dt 


ox k 


The Christoffel symbols and the quantities y a appearing here have been given 
in Section 67 (equations (67.25) and (67.21)). We now transform those equations 
a little. We introduce the quantity U* following (67.11) and can then replace 
the first equation of (67.25) by 


n_Ci _ -i-1 

“ ' c- M Sm c a SI I 


The remaining equations of (67.25) can be written as 

2 idV, 






and 


r« 

1 kl 


1 8U 

l;(yi$ac + yi&ii) — — 8 *; 


(69.04) 


(69.05) 


(69.06) 


Inserting these expressions into (69.03) we obtain 


8l H0 8T ik 

= ^f + -~r + 2 y° T<0 + 2 y* Tik 

_b _ . t _ i ( s Jh_ --—t“ 

\ c 2 / \ 8xi ' c 2 dt / c 2 \8xic 8x,i / c 2 8xi 

(69.07) 

We multiply this expression by the determinant g which is approximately 
given by 


9 


4 +S) 


(69.08) 


and we use the fact that the coefficients of T 00 and of T kk are nearly proportional 
to each other. We get 
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w ^) 

y * 8t + 8x k 

(dU * 


+ 1^ + 4 ^A(c 2 T°° + T kk ) + if---* - 4 ^) 2 ™ (69.09) 

\ 0a;f c 2 dt ) \(>x k oXi / 

This is an expression for g times the divergence of any symmetric tensor. 
However, if 7 ,|iv is the mass tensor the following equations hold : 


1 e 2 u* 

A U* - 

c 2 dt 2 


■1~y(c 2 T 00 + T kk ) 


(69.10) 


and 


A U ic — — 4tty c 2 T 0i 


(69.11) 


They were established above (equations (68.19) and (68.30)). With their use 
the right-hand side of (69.09) can be transformed further. We consider to this 
end the quantities N tw introduced by (68.14)—(68.16) and note that in the 
gravitational equations these quantities combine with the tensor T |XV in the form 

c 4 

— gT tvt — -— A 7 ^'' 


We form the sum of derivatives 


A* = 


87ry 

8A U 


Putting for brevity 


8x v 

8t 8x s 


(69.12) 


(69.13) 


(69.14) 


we get 

8N i0 8N ik 2/8U* 4 8U f \ 

^ 8x k e 4 \ 8xi 


dt 


4 mu, 

c 2 8t )\~ 


1 8 2 U* 4 8V\ 

A U* -— d- 

‘ c 2 dt J 
8x s j 


8 / 8Ui 8U S \ / rT 0T\ 

+ (6916) 


c 2 dt 2 
8V t 


Since to first approximation the relation 


8U 8U S _ 
dt 8x s 


(69.16) 


is valid (see (55.42)) the quantity T will be at least of order 1/e 2 so that in 
(69.15) terms containing derivatives of T may be neglected. Discarding such 
terms and using (69.10) and (69.11) we obtain from (69.15) 


-AIM 

87ty \ dt 


8N i0 8N ik \ 

Sx k j 


/ dU* 4 dUi\ /dUi dU k \ 

- (1* + c-i wF 7 ”” + r “> + ^ < 6917 > 

Now, the right-hand side of this equation coincides with the additional terms 
on the right of (69.09). Thus the latter are expressible as sums of derivatives. 
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Therefore, if is the mass tensor we have 

-1(^“ - £ »*) + k (»*" - £ ff “) < 6SU8 > 

On the other hand differentiation of the gravitational equations written in the 
form (68.18) gives 

U-L»\(W.W\ 

\ c 2 8ty\ et + dx k ) 

- t II, K - sf"“) + k K - w “)j < 69 - 19 > 


Comparing the last two equations we may write 

16 nyg 


/a — i \ / g9< ° _l ^ 1 \ — 

\ c 2 & 2 /\ 8t + 8x k ) ~ c 3 




(69.20) 


This approximate relation plays an important role in the derivation of equations 
of motion. 

A similar relation may also be derived for the zero component of the diver¬ 
gence of the mass tensor. Differentiating the approximate Einstein equations 
(68.18) we find 


JL /a- i—) 

2c \ c 2 8t 2 ] 


0 Q O|X 8N 0il 8 , 

— _i-= —I- (gT°») 

c 4 8 Xtl 


(69.21) 


To evaluate the second term of the left-hand side of (69.21) we differentiate 
the expressions (68.14) and (68.15) for N°° and N m . Using (69.16) we get 


8N oo 8N 0 i 6 8U 8 at/ T 

-1-=-A?7 H-A V { 

8t dxi c 6 8t c 6 8xt 


(69.22) 


Here we may express the quantities A U and Af7< in terms of T°° and T°* 
by use of (68.25) and (69.11). Then we have 


8N0V - = T „ 0 _ 32"y su yo . 

ox c 4 8t c 4 8xt 


(69.23) 


Inserting this into (69.21) and using the expression (69.08) which gives the 
determinant g in terms of U, we obtain after multiplication with 2c 

/ 1 8 2 \ 3q 011 16 tcy / 8T 0|x 1 8U \ 

(A- \— = —--gl -+- T 00 \ (69.24) 

\ c 2 81 7 8 X[l c 3 y \ 8x v ^ c 2 8t ) V ' 


Instead we may put 


( 


A = T*» 

c 2 8t 2 / 8 X[l c 3 ^ 


(69.25) 


for, by (65.23), the zero component of the divergence of the mass tensor is 
equal to the bracketed expression on the right-hand side of (69.24). Equation 
(69.25) has a structure completely analogous to that of equation (69.20) for the 
spatial components of the divergence of the mass tensor. 

Both equations may be combined as 




1 8 2 \8 16tty 


_ t "7 rrui \j 
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As was stressed at tlie beginning of this section these equations are approximate. 
They are remarkable because the differential operator on the left-hand side has 
constant coefficients ; consequently the equations can be very conveniently 
investigated. 

70. The Equations of Motion and the Harmonic Conditions 

The problem of the motion of a system of masses has already been formulated 
generally in Section 64. Now we must investigate the form of the equations of 
motion. 

We know that if each moving body is regarded as a continuous medium then 
within each body the equations 

VfT^ = 0 (70.01) 

must be satisfied. On the other hand one can consider the motion of each body 
as a whole and characterize it by a finite number of parameters such as the 
coordinates of its mass centre, the values of its mass and its moments of inertia 
and so on. 


The problem then arises to find the equations of motion of bodies as a whole, 
i.e. to find differential equations for the parameters characterizing each body. 

We have just enumerated some of the parameters characterizing a body. 
What is it that determines their choice? In the first place it must be required 
that the parameters chosen are sufficient to find the forces which are exerted 
on other bodies and which determine their motions. In our problem the forces 
in question are those of gravitation. Therefore, the parameters characterizing 
a body as a whole must be so chosen as to permit a sufficiently accurate deter¬ 
mination of the gravitational field produced in the region containing the 
remaining bodies. Since the bodies are supposed to be at great distances from 
each other the parameters characterizing the body must then satisfactorily 
determine the gravitational field at large distances from it. The parameters 
listed above have this property. 

What has just been said applies to Newtonian mechanics just as much as to 
Einstein’s mechanics. Since in Einstein’s theory, gravitation is related to 
the metric and it is the metric that has immediate influence on the motion of 
bodies, the problem of finding equations of motion for a system of bodies must 
be solved in conjunction witli the study of the metric at large distances from the 
bodies. Both the choice of parameters and the form of the equations of motion 
must be subject to the requirement that at large distances from each body the 
metric should be given correctly. 

At the end of Section 69 we derived the approximate relations 




1 £ 2 \?cf v 
c 2 8t 2 j dx [U 


1 firry 




(70.02) 


They follow from Einstein’s equations written in the approximate form (68.18). 
In the rigorous solution both the right-hand side of (70.02) and the expression 
acted on by the d’Alembert operator on the left-hand side vanish by virtue of 
the equations (70.01) and the harmonic conditions 


r-if 


=-- 0 


(70.03) 
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If, however, the solution considered is approximate, the conditions (70.02) 
establish a relation between the order of approximation in w’hich the equations 
(70.01) are valid and that in which the harmonic conditions (70.03) hold. 

We have seen that in seeking the laws of motion of bodies as a whole a basic 
requirement is that the metric at large distances from each body is given 
correctly. This requirement means in particular that at large distances from 
all bodies the harmonic condition (70.03) must be fulfilled with greatest possible 
accuracy. Let us consider w r hat consequence this has for the right-hand side 
of (70.02). 

These equations have the form of the usual equation for a retarded potential 

A +-- 2 ^ = - 4 ™ ( 70 . 04 ) 

c l oG 

but it must be remembered that in the very process of establishing the equations 
approximations were made that are equivalent to an expansion of the functions 
involved in inverse powers of the speed of light. Therefore it would be sufficient 
in this case to solve (70.04) approximately. 

Nevertheless, we begin by examining the exact solution. We assume that 
the function er on the right-hand side of the equation, which we shall term the 
density, differs from zero only in a limited region of space in the neighbourhood 
of a point having the coordinates 

n - aft) (i = 1, 2, 3) (70.05) 

(This is the region occupied by one of the masses.) We are interested in that 
solution of the equation which corresponds physically to the potential pro¬ 
duced by the moving mass of density a, i.e. the retarded potential. This solution 
has the form f [a] dV' 

V ~ J \r - r' | 

where [ cr] is the retarded value of the density, i.e. : 

[er] = <j(T, r') ; t' = l — - I r -r' j 

c 1 


(70.06) 

(70.07) 


The integration is over the coordinates (x', y', z') and extends over the region 
occupied by the mass. 

We now pass to the approximate equations. Expanding [a] in inverse powers 
of the speed of light e and retaining only the first few terms, w r e get 


* 


T adV 

J I r- r' 


1 d 
c. dt 


where now 7 


a = a(l, r') 


(70.08) 

(70.09) 


This expression can be further expanded in inverse pow'ers of the distance 
from the mass in question. 

Using the expansions 


r — r 


1 , (x i ~ n ^ s 'i ~ , 

"• i — ~ la T" * • ♦ 


r - - a 


K - a i)( x i - a i ) 


(70.10) 


r — a 
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and putting a ( t> r ') dV = p 

J o(t, r')(x' — at) dV' = p ( 

we obtain 


(70.11) 


, t* M x i — a <) 

I r — a I I r — a I 3 ’ ’ ’ c dt 


+ 


1_ d 2 

2?dT 2 


Hi(x{ — at) 


+ ... (70.12) 


Tbe quantities p and at may be called the “ moments ” of zero and first 
order. It is easy to see that if one puts p = 0 the terms not containing c will 
decrease at least as 1/1 r — a | 2 , the terms inversely proportional to c will vanish 
and the terms proportional to 1/e 2 will remain bounded. If, in addition, it is 
postulated that the first moments vanish, i.e. p; = 0 , the terms next in order of 
importance in the expansion (70.12) will also vanish. If the second moments 


j rs(x{ at)(xic ate) dV — pfj: (70.13) 

are also zero, further terms vanish, etc. In general not all these conditions are 
independent. But we can impose as many independent conditions as we have 
parameters at our disposal. 

We assume now that there are several masses so that the density a differs 
from zero in the vicinity of several points, say 


Xi = a t (t); 

Xt = bi(t). . . 

(70.14) 

Then the same reasoning will apply to each point. Using the notation 


p ( °) = j" a(f, r) dV ; 

(a) 

3? 

II 

<r-*« 

(70.15) 

and p[ a> = j" o(f, r)(x { — a t ) dV ; 

(a) 

'“’cr 

II 

3 :*_ j 

Ofc 

J* 

1 

Jp* 

(70.16) 

for the integrals (70.11) extended over the regions occupied by each 
masses separately we find for the expression 

of the 

, v f i<a> , v i A a) ( x t - a t) 

1 d v - ' , . 

- - - y fx (a) 

c dt 

a 


* ^l r " a l 1 * l r ~ a l 3 


+ _L£ly U>l 

n 2 c 2 **Z,r 1 

a 

r a| + 

1 1 r — a j 

(70.17) 


In order that <]; should be small at any point between masses and also at a 
large distance from all masses we may now require that a series of equations 

p (0 ) = 0) p<«> = 0, ..., (70.18) 

should hold, each of which refers to a single mass. 

After this general discussion we return to the study of equation (70.02). 
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We shall try to satisfy the condition that at any point between masses, and also 
at a large distance from all masses, the harmonic condition (70.03) be satisfied 
with the greatest possible accuracy. 

When equations of motion are stated in four-dimensional form the equation 
expressing energy balance usually appears as a consequence of the remaining 
equations of motion. Therefore we shall first deal with the spatial components 
in the relation (70.02), corresponding to v = 1,2,3, and shall later verify that the 
conditions involving the temporal component v = 0 are also satisfied. 

Comparing (70.02) with (70.04) we may put 

a = a i =-gVJ° { (70.19) 


e 3 8o ai 

and <1/ = «!»* = — — (70.20) 

4y 8x a 

To make the quantities (70.20) small outside the masses it is necessary to satisfy 
a set of conditions of the form (70.18). In the first place we require the following 
integrals over the volumes of the separate masses to vanish 

jxCo) = — J fT^dx^x^dx^ — 0 (70.21) 

But, by (69.09) 

jv„ 2 - = »<*n + ±wr*) + (~ + j, 

+ i(— - —(70.22) 
\8x k dxtJ 

Therefore the equations (70.21) may be written as 

-11- | (is? + p Tr)^ 10 + T “ m ’ 

(a) (a) 

, . f ev,\ , 

+ 4 J te - -£r) 1 'w <™- 23 > 

(a) 

where we have put for brevity 

(da;) 3 = dx t dx 2 dx 3 (70.24) 

We should also remember that 

9 = - c 2 - 4 £7 (70.25) 

Expressions for the components of the mass tensor of an elastic body were 
obtained in Section 66. According to (66.07) they have the form 

c 2 T°« = pjl + i(iv 2 + II - IT)} 

c 2 r»‘ = p*,{l + I (§o 2 + II - £/)} - ~pav k (70.26) 

c ifik — p ViVk _ p ik 

If we confine ourselves to leading terms the result of inserting (70.26) into 
(70.23) has the form d r C 8V 

pvi(dx) 3 = J 9-^r(dx) 3 (70.27) 

(o) (a) * 
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Hence it is clear that the relations (70.21) represent the equations of motion 
of the mass centres of the various masses. If the number of masses is n there 
will be 3 n such equations. This will be the number of degrees of freedom of 
our mechanical system if the masses are treated as point-like. 

Passing on to the discussion of further conditions of the form (70.18) we can 
postulate that the following combinations of first moments vanish 

^ = -/ 9(*iVJ ak - x k VJ«'){dxy = 0 (70.28) 

If here also we confine ourselves to the main terms, equations (70.28) assume 
the form 

if r ( dU am „ 

ft J P- x kVt)(dx) 3 = J p^asi— - x k —)(dx) 3 (70.29) 

(a) (a ) 

Evidently they represent the law of the variation of angular momentum for 
each body. Here the angular momentum includes both the orbital moment 
arising from the motion of the mass in its orbit and the intrinsic moment arising 
from its rotation. The orbital moment can be separated out by suitably com¬ 
bining equations (70.27). and (70.29). The number of equations of the form 
(70.28) is also 3m, therefore if it is assumed that the masses rotate as rigid 
bodies the number of equations is equal to the number of rotational degrees 
of freedom. 


71. The Internal and the External Problems in the Mechanics of 
Systems of Bodies. Newton’s Equations for Translational Motion 

In the following we shall distinguish between the internal and the external 
problem of mechanics. We shall relate the equations of motion within a body to 
the internal and the equations of the motion of a body as a whole to the external 
problem. 

We shall consider in various approximations the influence of the internal 
structure of a body on its motion as a whole. 

We saw that, in order to derive the integral forms (70.27) and (70.29) of the 
Newtonian equations of motion, it was sufficient to use the equations of the 
internal problem to the extent of the equation of continuity 


dp d(pvj) 

dt dXi 


(71.01) 


which determines the zero approximation for some of the components of the 
mass tensor, namely 

c 2 T°° = p ; c 2 T<* = pv t (71.02) 

Thus, in the approximation in question it was hardly necessary to make any 
assumptions concerning the internal structure of the body. Incidentally, a 
more detailed discussion of the internal structure of a body is itself impossible 
without knowledge of the metric to a corresponding accuracy. If one calls 
the Euclidean metric the zero approximation, then the first approximation 
beyond it already requires the introduction of the Newtonian gravitational 
potential U and also of the vector potential U j. This is what was done in Sec¬ 
tion 55 on the basis of the expressions (71.02) for the mass tensor used there. 
Physically, the first approximation to the metric corresponds to taking account 
of the force of gravity in the study of the internal structure. 
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In Section 66, when we derived the mass tensor (as given above in (70.26)) 
we required not only the first approximation to the metric but also some- 
definite assumptions as to the internal structure of the body. In fact we took 
the body to be elastic. In addition to the equation of continuity (71.01) we 
had to assume that the Newtonian equations of motion were satisfied within 
the body : 


(8v i dvi \ 8U dpoc 

P \ 8t Vk 8xJ P cxi dx/c 


(71.03) 


The equation of continuity for the internal problem enables one to derive 
the Newtonian equations of motion (70.27) for the external problem. On 
the other hand the non-relativistic equations (71.03) of the internal problem 
and the mass tensor (70.26) based on them ensure that in the external problem 
one can obtain the second (relativistic) approximation to the equations of 
motion of the body as a whole. This is quite natural, since the non-relativistic 
equations give the distribution of energy and of related quantities within the 
body, and energy possesses mass which influences the gravitational field in 
relativistic approximation. 

Our next problem is to obtain in an explicit form the equations of motion 
that follow from the integral relations established at the end of the preceding 
section. 

To construct these equations it is first of all necessary to state what degrees 
of freedom it is intended to study, or in other words by what parameters the 
mechanical system is to be characterized. The considerations that determine 
the choice of parameters have already been stated at the beginning of the 
preceding section. Based on these considerations we shall discuss those 
degrees of freedom which correspond to, first, the translational motion of each 
body separately and, second, the rotation of each body about its mass centre. 
In doing this we shall assume that the bodies rotate as if they were rigid. (Of 
course, this does not mean that the bodies are assumed to be rigid; fluids 
can also perform such a motion. )f 

By virtue of the equation of continuity the mass of a body 

M a = f p {dxf (71.04) 

(a) 


is. a constant; therefore it is not to be numbered among the variable para¬ 
meters. The translational motion of a body is determined by the change in the 
coordinates at of its mass centre. These quantities may be introduced by the 
relations 

M a at = f p x { (dx) 3 (71.05) 

(a) 

which may also be written as 

f p(x t — at)(dx) 3 = 0 (71.06) 

(a) 


t We note that our calculation is of an approximate nature. The difficulties connected 
with defining the concept of a rigid body in the Theory of Relativity do not yet arise in the 
approximation considered. 
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Differentiating the integral (71.05) with respect to time and using the equation 
of continuity, we obtain 


j t j p Xi(dx) 3 = J ^ xidxf = - J x t (dx) 3 = J p v t (dxf (71.07) 


whence 


f p Vi(dx) 3 — M a &i 

(a) 


(71.08) 


It follows from (71.08) that regardless of the distribution of velocities within 
the body the left-hand side of equation (70.27) is equal to the product of the 
mass of the body and the acceleration of the mass centre. 



M a di 


(71.09) 


Let us calculate the integral on the right-hand side of (70.27). The Newtonian 
potential V may be split into two terms 

I7(r) = M «(r) + t/< a >(r) (71.10) 

Here u a is due to the mass M a while V ia) is due to all the other masses. In 
accordance with this splitting we get 

f 8U , c due r SUM 

J <*>• = J <*»* + j f -&r <*>’ (7U1) 

(o) (a) (a) 

Within the mass M a the potential u a satisfies the equation 

A u a — — 4r7ryp a (71.12) 

where p' a is the density associated with that mass. The potential may be written 
as the integral 

'(dx') 3 


U a 


-If 

(a) 


(71.13) 


On the other hand the potential U ia) which originates in the other masses 
will be a slowly varying function of the coordinates within the mass under 
consideration and it may be expanded in a Taylor series in powers of (xj — %). 
This makes it possible to evaluate approximately the second integral in (71.11). 
Inserting the expression (71.13) for u a into the first of the integrals we get 

f P ~ (dx) 3 == -T J J (Xi - x' { )(dx) 3 (dx') 3 (71.14) 

but the double integral on the right is zero because the integrand is anti¬ 
symmetric in the coordinates of the two points. Thus 

du a 

P- 


/ 

(a) 


dXi 


(dx) 3 = 0 


(71.15) 


This relation can be interpreted as the statement that the resultant of the 
internal gravitational forces vanishes. 

We give another proof of this relation. We put 

du a Su a 
dXi Sxjc 


& 


18<*(grad u a ) 2 


(71.16) 
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c>u a . 
—— 

OXi 


(71.17) 


jK 

47ty Sx/c ° 8xi 


(71.18) 


as a result of (71.12). If in the integral (71.15) we take the density p to be 
p a , the density of the particular mass under consideration, we can extend the 
integral over the whole volume of space, which gives 

/ » S - J <" ■& <*»■ - 4 ^ J ^ - 0 (7U9 » 

(a) <°o) (■») 

since at infinity the become zero. 

We now turn to the evaluation of the second integral in (71.11). Expanding 
the potential f7 (a) of the external masses in a Taylor series we get 
a?7<«> /0[7«*)\ / 8 2 U {a) \ 


/8U la) \ / S 2 Z7<“> \ 

^\~^)a + \d^rJ a {Xk ~ ak) 


8 3 U la) 


(Xk - a k ){xi — at) + ... (71.20) 


1 2 \8xi dxjc dxt/ a ~ ‘ ' 

Multiplying this expression by p, integrating over the volume of the mass 
(a) and using (71.06), we obtain 


).+*«( 


8 3 U ia) \ 
8xt 8x k 8xJ 


(71.21) 


where 1$ denotes the quantities 

Ifi = J (a)p (x k — a k ){xi — ai){dxf (71.22) 

which we shall call the moments of inertia of the mass (a). (In mechanics this 
name is applied to somewhat different quantities.) We now have to find the 
value of the potential f7 (o1 . We have 


U <“>(r) 


= 2^17^ 


(71.23) 


where the prime attached to the summation sign signifies that the mass (a) 
is excluded from the summation. Each term in the sum represents the potential 
due to the corresponding mass. We consider a single term such as 


f ?W) 3 
T J Ir-r'l 


(71.24) 


We insert the expansion 


|r —b| 


( x k fyfc) 


8x k I r — b I 


+ o ( x k — b k )( X 1 - h i) w—v 


8x k 8xi r — b 


(71.25) 
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integrate term by term and use (71.06) and the definition (71.22). This gives 
Y M b 


u b ( r) = 


+ oY4V 


a 2 


b I ' 2 1 kl 8xic dxi | r 


Therefore we have 


p “ ,(r) -riF^+^8' 


a 2 


+ 


l 


+ 


(71.26) 


(71.27) 


6 i- 6 8xk 8x1 l r — b | 

Let us estimate the order of the terms in the two sums occurring here. Let 
L be a length that characterizes the linear dimensions of the bodies and R a 
length of the order of magnitude of the distances between them. (See Section 
64.) Let q characterize the order of magnitude of the velocities of the bodies. 
We shall take it that in orders of magnitude 

y M 


9 /T 


(71.28) 

The order of magnitude of the moments of inertia will evidently be given by 

hi ~ ML 2 (71.29) 

therefore the first sum in (71.27) will be of order q 2 and the second of order 
q 2 L 2 IR 2 . The omitted terms indicated by the dots will be of yet higher order 
in the small quantity LjR and we neglect them. On the other hand it is readily 
seen that in (71.21) the second term will be of order L 2 fR 2 relative to the first. 
Therefore when calculating the second term we can discard terms in U ia) that 
are of order q 2 L 2 /R 2 whereas they must be retained in the first term. With a 
little manipulation we get 

= (tS + 5 s ‘ 1 


to) 


Since the quantity 
S 2 


daic dai | a — b | 

(71.30) 


1 


§ kl 


+ 


3(oj; — bjc)(ai — bi) 


(71.31) 


dan dai j a — b | | a — b | 3 | a — b | 5 

is symmetric in a and b the expression in curly brackets in (71.30) will also 
be symmetric. Introducing the double sum 

— _! V IfMaMb , Y 

~ 2A h 


d> 


a, b 
(a^b) 


Iy MaMb Y J(a)) d 2 _1_ 

-b| + 2 { “ ** + 6 kl) da k dai la - 


(71.32) 


we can therefore write 


to) (a) 

Equation (70.27), which we restate, 


r du , r du (a> , 

J J 

(a) 

we restate, 

1 


dai 


(71.33) 


(71.34) 


ao> 


then assumes the form 
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The quantity O is evidently the Newtonian potential energy of our mechanical 
system expressed in terms of the coordinates and the moments of inertia. We 
upta that-onr derivation has as yet not required any assumptions concerning 
the distribution of velocity within the body, apart from what is implied by 
(71.01). 

If all bodies under discussion possess spherical symmetry the inertia tensor 
hi of each body will be proportional to the unit tensor 

hi = Ihi (71.36) 

In that case the potential energy will not contain the moments of inertia at 
all, but will depend only on the coordinates at (and, of course, on the constant 
masses M a ). Then the system of equations (71.35) is fully determinate: it 
contains as many equations as unknown mass centre coordinates. It is just the 
usual system of equations for the motion of a set of mass points subject to an 
attraction given by Newton’s Law. 

In the general case, on the other hand, when the bodies are not spherically 
symmetric the equations of motion involve not only the mass centre coordinates 
of the masses but also their moments of inertia, and the system of equations 
(71.35) is then not fully determinate. In that case a determinate system of 
equations may be obtained if it is assumed that the bodies rotate about their 
mass centres in the manner of rigid bodies. This means that the distribution 
of velocities within each body has the form 

v i = d l + - af) (71.37) 

where h'f is the three-dimensional antisymmetric tensor of angular velocity of the 
body (a). Omitting the label (a) we Gan alternatively write its components as 

w 23 = W 1 > ^31 “ w 2 > w 12 = W 3 (71.38) 

Under such an assumption the complete system of equations is obtained by 
adjoining to (71.35) some equations which prescribe the rates of change of the 
angular momentum of each body. The derivation of these equations will be 
given in the following section. 


72. Newton’s Equations for Rotational Motion 


We put 

M ik = f P[( x t ~ a tK - ( x k ~ a *) v t \( dx ) 3 ( 72 - 01 ) 

(a) 

Evidently, this is the angular momentum of body (a) referred to its own mass 
centre. The law governing the change of this quantity can be obtained from 
(70.29) after separating terms referring to the orbital angular momentum. 
This is effected by use of the relation 


d 

dt 



pv k (dx) 3 — a k J pvt(dxf j = a t J 

(a) (a) 



P j- (dx) 
oxt 


3 


(72.02) 


which follows from the equations of motion (71.34) and from equation (71.08), 
by virtue of which 

at f pv k (dx) 3 — d* | pv t (dx) 3 = 0 

(a) (a) 


(72.03) 
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We restate equations (70.29) 

j t j* p(aw* - x k v t )(dx) 3 = J ( dx f 

to) to) 

Subtracting from them the relations (72.02) we get 


to) 

or, using the definition (72.01): 
d 


. M?> - 
df 


.. c \, at/ , x ar; 

(a) 


(da ;) 3 
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(72.04) 


d f 

p [(xi — a ( )v k — ( x/c — a k )vt]{dx) 3 
to) 

r r du du i 

= J p j^(a-i — ai) — — (a;* — a k ) — J (da ;) 3 ( 72 . 05 ) 


(72.06) 


We now calculate (he quantity M\ a f. Inserting (he expression (71.37) for the 
velocity into (72.01) and using the definition (71.22) for the three-dimensional 
inertia tensor we get 

A C Jglft - (72.07) 

In more detailed notation these relations appear as 

M 2 3 — (1 22 13 3) CO23 “ ^12 0) 31 113 (,> 12 

M 3i = - I j 2^23 U" (f 33 t Ill) w 31 — ^23 (0 12 (72.08) 

•^12 — d 13(023 “ f 23^31 "I (d 1 x -(- 1 2 2 )c0l2 

For brevity the label (a) has been omitted. These are the well known equations 
in the mechanics of a rigid body, which relate angular momentum to angular 
velocity. 

We now evaluate the right-hand side of equation (72.06); it evidently rep¬ 
resents the moment of the forces acting on the body. In the sense of the separa¬ 
tion into external and internal forces that was given in (71.10) let us first consider 
the moment of the internal forces. With the aid of (71.18) it is easy to verify 
that it vanishes. For by the same argument as was used to derive (71.19) we 
obtain the equation 

J p|(iCf - Oi) ~ - ( x k - a k ) —(da;) 3 

to) 

= ^ {{x * ~ °‘W " (r * - = 0 < 72 - 09 ) 

‘ (®) ' 1 


The moment of the external forces remains to be calculated. Using the expansion 
(71.20) of which only the constant and the linear terms need be retained, we get 

r 8U< a> 8U fa) l 

P - Oi) —- (j: k - a k ) —— (djf 

J L 8>x k d'Xi J 

to) 




We insert into this the expression (71.27) for U (a) , of which only the first sum 
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need be retained. This gives 

C f dU (a) 8U {a) ~\ 

J p [ <a: ‘ - '* l) "liT “ (x * ~“*> “S" 

(a) 

= V. y m\ m —— - —-— r - m _— . . 1 .) (72.ii) 

^ r Sa* 3a ; ja -b| w do, da> |a - b| j v ' 

Since by (72.09) the moment of the external forces is equal to the total moment, 
we may replace lT a) by U ; then (72.11) gives the right-hand side of (72.06), 
so that we may write the law for the rate of change of the angular momentum 
of body (a) asf 

I«!? -1 ^ - V ^ ]^) <™> 


Introducing the explicit expression (71.31) for the second derivatives we can 
also write 


d 

dt 



' 3yMb{af — bj) 
| a — b | s 


[(«* “ h^if - K ~ h i) I ki] 


(72.13) 


These equations supplement the previously derived equations of motion of the 
mass centres 


M a d { = — 


a® 

()a i 


(72.14) 


and render the system of equations determinate. That this is so may be seen 
in various ways. We can consider as our unknowns characterizing the motion 
of each mass the quantities 

« t , Iff, <*%> (72.15) 

twelve quantities in all. (The components of angular momentum can be 
expressed in terms of these by equation (72.07)). For these quantities we have : 
the three equations (72.14) (the mass centre motion), the three equations 
(72.13) (the law describing the change of angular momentum) and, in addition, 
the six equations 

,U(a) 

= + ( 72 - 16 > 


which hold for any tensor whose components are constant in a coordinate 
system fixed to the body. (These equations will be derived below.) Thus we 
have twelve equations for the twelve unknowns, so that the system of equations 
is fully determinate. 

We could also have argued differently by considering for each body a co¬ 
ordinate system (x*, x*, x*) rotating with the body (we again omit the label 
(a) for brevity). Denoting by a r( the direction cosines which satisfy the 
relations 

V-riK-r] — Si; Kri&si — S rs (72.17) 


t This equation was first derived from Einsteins equations (by another method) by 
V. P. Kashkarov [39], 
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= «-ri( x t - a t ) ; x t -a ( = a ri x* (72.18) 

The derivatives of the direction cosines are related to the components of the 
angular velocity by the relations 

«ri = v-r)(Oji ; (Oji = a rjin (72.19) 

By well-known relations in the kinematics of a rigid body the nine direction 
cosines may be expressed in terms of the three Eulerian angles h, cp and <j>. 
For each of the masses we could have replaced the unknown functions (72.15) 
by the six quantities 

a x , a 3 , &<“>, <>«*> (72.20) 

and could have expressed all other quantities in terms of these, in particular 
those given in (72.15). For instance, (72.19) already expresses the angular 
velocity in terms of the direction cosines and their derivatives. As for the 
moments of inertia, they may be expressed as 

I„ = < 72 - 21 ) 


where the If are the constant values of the components of this tensor in a 
coordinate system fixed in the body. For the six quantities (72.20) related to 
each mass we would then have six equations (72.13) and (72.14), i.e. the required 
number of equations to make the scheme determinate. 

The calculations just described of the equations of motion in Newtonian 
approximation presuppose the possibility of discarding (in addition to relativis¬ 
tic corrections) all terms of higher order in the small quantity L/R. If we wished 
to retain these terms we would have to discuss third and higher moments 
such as 

1{ m = f P(*< - )(x k ~ a k t x i ~ ajidx) 3 (72.22) 

(a) 


However this would not destroy the determinateness of the system of equations, 
for we could regard the additional quantities introduced as functions of the 
Eulerian angles, e.g. 


hkl ~ *ri a sk a uJ*su 


(72.23) 


where the I* su are constants. Then the quantities (72.20) would remain the 
only unknown functions, so that the number of these would not increase. 
Using the other procedure we would include the newly introduced quantities 
among the unknowns and correspondingly we would augment the system of 
equations by writing down, e.g. for the quantities (72.22) the equations 

~ f *«> + (72.24) 


In the general case of an arbitrary three-dimensional tensor At t i s . . . t„ having 
the constant components A * r .,... r „ in the system fixed to the body we would 
have 

^ h' * ’ ti Olji^Ajj • hi h ;'i, . . . i£ . j, j, i x , • • * i n 

+ - - v j 


(72.25) 
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These equations follow directly from the transformation equation 

q; ^2 ‘ ‘ ■ 2 * ’ * * ^‘ r n^n^ r l 1 ~2 * * * r rt ('2.20) 

in conjunction with (72.19). 

In conclusion, we verify that the law of conservation of energy is satisfied 
for the system of equations (72.13) and (72.14). We introduce the rotational 
kinetic energy of a body about its mass centre 

T a = K= Wog> (72.27) 


When forming the time-derivative of T a one must bear in mind that by (72.16) 


t lf(a) 

“SK* -# = 0 


(72.28) 


so that in the differentiation of T a the Iff may be treated as constants. Remem¬ 
bering this, it is easy to obtain the relation 


dT 

~i = W* -# _ (72.29) 

If we introduce the expression (72.12) for the derivative of M'-fl into this 
equation and if we use the antisymmetry of the we get 

dT a, , , 'sr'' , . 0 2 1 

—- = w > yi¥„Z ?>- 1 -r 72.30 

dt ik i b 1 (Ja i 6CLk | a ~~ b | 

or, after symmotrization with respect to j and k and subsequent relabelling 

§ -1 2 •<**#’* + <«’> ^ <»■*» 

or, finally, by (72.16) 

^! = -V y M b '-f- —— -—-— r (72.32) 

dt 2 2 _, dt Sat da k a — b 

b » 1 

We recall that the expression (71.32) for the potential energy was 

•'- -52IS; +1<*•« + ^ T=bi) (72 - 3, » 


<b = — l V f y MaM b , Y , M J( b) M J(a)\ jL _ I _ ) 

2 Z lla - bl + 2 [ a ik n bik ’ ba t 8a k la -b|J 


(72.31) 


(72.32) 


(72.33) 


The quantity d> depends on time through the coordinates o< and through the 
moments of inertia 1$. This latter dependence leads to the following terms in 
the total derivative of <t> with respect to time: 

d ( b \y 0 ( I» . Ny dT a , 79 „ n 

*-2iS“'=-2w (72 - 34) 

a a 

this may be seen by comparing (72.33) and (72.32). Hence using the equations of 
motion for the mass centres we get 

it ! 2 (lM ^+ r . } + °)=° < 72 - 35 > 


(72.35) 


and, therefore, we have the law of conservation of energy in the form 

I(hM a d> + T a )+<l> = E (72.36) 

a 

where E is the energy constant. One should note that even for a very fast 
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rotation of the bodies, when the linear velocities of the translational and the 
rotational motions are of the same order, the rotational terms in the equation 
of energy balance (72.34) still remain smaller by a factor Z/i? than the main 
terms, while in the integrated equation (72.36) they are of the same order as 
the main term. 


73. The Internal Structure of a Body. Liapunov’s Equation 

At the end of Section 70 we found the second approximation to the equations 
for translational motion in integral form. We now write them down in more 
detail. By inserting the components of the mass tensor of an elastic body (70.26) 
into the equations of motion (70.23) we obtain 





1 + (l« 2 + II + 317) 


— -o PikVicHdx) 


(a) 

i nw, ev t \ 

+J te - 


(73.01) 


According to (68.30) the quantity U* appearing here satisfies the equation 

At/ * - = - 4 *t{p + £ (i* 2 + 11 - V) - (73.02) 

In first approximation U* is equal to the Newtonian potential U, but we require 
U* in second approximation, including retardation corrections and including 
terms arising from the additional terms on the right of (73.02). On the other 
hand it is sufficient to know the first approximation to the quantities Ui which 
satisfy the equations 

A Uf = -i~ {9 Vi (73.03) 


We have pointed out in Section 71 that in order to obtain from (73.01) the 
explicit form of the relativistic equations of motion it is necessary to consider 
the internal structure of the body in the Newtonian approximation described 
by the equations (71.03) 


8 Vi 8vt\ 8U Spue 
81, ^ k dx/J ^ 8xi 8x-k 


(73.04) 


We confine ourselves to the study of those cases in which the body rotates as 
a whole, in the manner of a rigid body. Then (see (71.37)) the distribution of 
velocities within the body has the form 

Vi — dt + (ojfxj — aj) (73.05) 


(We again take the label (a) on for granted.) 
particle in the body 



+ Vk 


Svi 

dx k 


Hence the acceleration of a 
(73.06) 


is given by 


Wi = di + (ioji — a>(k(x>jk)(xj — aj) 


(73.07) 


We again split the Newtonian potential U appearing in (73.04) into an internal 
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and an external potential, u a and U (al respectively and replace the latter by the 
first term in its Taylor expansion about the point xj = ay. We also insert the 
expression (73.07) for the acceleration wt. Then we get 



+ p 


<Oji — <■Oik(Ojk 



(X] - a,) 


dptk 

Sx k 


(73.08) 


We now estimate the order of magnitude of the various terms, retaining 
only the leading ones. A simple consequence of the Newtonian equations of 
motion (71.35) is that 

di ~ | 2 (73.09) 


The quantity (dU {a> /8xi) a in the second term of (73.08) will be of the same 
order as di, but the difference of the two quantities will be small. A comparison 
of (71.21) and (72.33) shows that 


di 


8U (a) \ q 2 L 2 

oxi )a R R 2 


(73.10) 


(For almost spherical bodies this difference will be even smaller because its 
value is determined by differences of moments of inertia and not by the moments 
themselves.) We can therefore take the first two terms of (73.08) to cancel. 
We now proceed to estimate the terms in the square brackets. Here the main 
term is out oq* ; it has the order of magnitude of the angular velocity squared. 
We assume that the order of the angular velocity is given by 


CO 


L 


(73.11) 


The order of magnitude of the time derivative of the angular velocity is deter¬ 
mined by the rate of change of the angular momentum. It is easy to obtain 
the estimate 

«2 


0) ~ 


R 2 


Comparing this with the estimate for o> we get 

L 2 

CO ~ CO 2 —— 

R 2 


(73.12) 


(73.13) 


Further, the second derivatives of the external potential with respect to the 
coordinates will be of the following order : 


8 2 W a) ^ qi 

dxt dxj a R 2 


(73.14) 


i.e. of the same order as co. Neglecting terms of this orderf we retain in the 
square brackets only the term With these simplifications we can write 


f This approximation is made only in the equations of the internal problem (see (80.13)). 
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the internal equations of motion (73.08) in the form 


I du a ) dVik 

? \8xi + ~ a l)j + = 0 (73.15) 

Let us put 

u a + jOHicOijkiXi — Oi)(xj — a } ) = Va (73.16) 

This is the sum of the gravitational potential and the potential of the centrifugal 
force. Equation (73.15) may be written as 


dV a Spa 

^ dx,i 8xk 


(73.17) 


This equation can be satisfied on the assumption that the stress tensor within 
the body reduces to an isotropic pressure p : 

pa — —p%a (73.18) 


(a liquid always satisfies this condition). If we had not neglected the quantities 
a>ji in (73.08) we would have been obliged to consider non-diagonal elements 
in the stress tensor Indeed it is evident that a change of the angular velocity 
of an elastic body must give rise to stresses that do not reduce to an isotropic 
pressure. 

Under the condition (73.18) equation (73.17) reduces to the following 


dVa __ &p 
^ dxi 8xt 


(73.19) 


The equation 


dp 8Va dp dVa 
dxk dxi dxi dxk 


(73.20) 


follows from it and shows that V a and p must be connected by a relation not 
involving the coordinates, so that if p is a function of one parameter a, the 
potential V a must be a function of the same quantity 

p = p(«); U a = F 0 (a) (73.21) 

The internal potential u a is a functional of p ; inserting its explicit expression 
into (73.16) we obtain 

Y J r l | + i a>aoijk(xt — at)(xj — aj) = V a (73.22) 

(o) 

Our problem consists in finding the shape of the body, i.e. the shape of the 
region of integration (a), for a given density p. This shape must be such that 
condition (73.21) is satisfied at all points within the body, i.e. such that the 
value of the left-hand side of (73.22) depends only on the density, or on the 
parameter on which the density also depends. In particular, the conditions 
a = const, and V a — const, must hold on the surface of the body. 

If the body does not rotate, so that to** = 0, all conditions are evidently 
satisfied by a spherically symmetric distribution of density over a spherical 
body. In the case of rotation (<oa =£ 0) the problem of finding the shape of the 
body is a very difficult mathematical problem which has been studied by 
many mathematicians. The most complete results were obtained by A. I. 
Liapunov who studied the near-ellipsoidal equilibrium shapes of a rotating 
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liquid, and also the question of their stability [21, 22]. We shall therefore call 
equation (73.22) Liapunov’s equation. 

If condition (73.20) or (73.21) is satisfied the pressure p can be found from the 
equation 

dp = p dV a (73.23) 


or 



(73.24) 


We determine the additive constant from the condition that the pressure 
shall vanish on the surface of the body. 

The expression for the mass tensor and also the equations (73.01) and (73.02) 
involve the elastic energy per unit mass II which, according to (30.11), is deter¬ 
mined by the equation 


r dp p 

J P P 


(73.25) 


As a result of (73.23) the integral appearing here is just V a or a quantity differing 
from V a by a constant. This constant can be so determined that one has 

p = p(V a - n) (73.26) 

This expression will be of use in deriving the equations of motion from the 
integral relations (73.01). 


74. Evaluation of Some Integrals that Characterize the Internal 
Structure of a Body 

In order to derive equations of motion from the integral relations (73.01) 
it is necessary to evaluate a series of integrals whose values depend on the 
density distribution inside a body and on its internal structure in general. 
To avoid interrupting our subsequent discussion we shall collect all such cal¬ 
culations of integrals in the present section. 

We begin with integrals that depend on the moments of inertia of the body 
and on its angular velocity. We denote by Q, the potential of the centrifugal 
forces 



= ~ «<)(*> - a j) 

(74.01) 

Liapunov’s equation (73.16) then assumes the form 


We consider the integral 

U a -(- Qa — Va 

(74.02) 

We have, evidently, 

Ta = f pO, a (dx) 3 

(a) 

(74.03) 


Ta = 

(74.04) 


so that T a is the rotational kinetic energy of body (a). We consider also the 
first moments weighted by the function pQa, i.e. the quantities 

T a( = f p&a(xt — a ( )(dx) 3 

( a ) 


(74.05) 
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Using the definition (72.22) we can write 

T ai = (74-06) 

These quantities vanish if the body has three planes of symmetry. 

We now consider the integral 

= if p u a {dxf (74.07) 

(a) 

which represents the negative of the energy of mutual attraction of the 
constituent particles of the body and we consider also the first moments 

e 0 i — if pw-i (Xi — ai)(dx) 3 (74.08) 

(a) 

With the use of Poisson’s equation (71.12) the quantity e a may be represented 
in the form 

e ° = g~ J ( 8 rad u a )\dxf (74.09) 

(CO) 

The moments e a i may be similarly transformed as follows : 


£ “‘ = J (grad u a ) 3 (x t — a^dx) 3 

(CO) 


Remembering the definition (71.16) of the quantities q[f 

, . ,. , , • du a du a 

= 18,.,. (grad u) 2 — —— — 
2 ik Ko a 8xi 8x k 

with 7^ = 2 (grad u a V 

we can replace (74.09) and (74.10) by 

e « = 4 


(CO) 


and 


ai 


= 4^ J q<k * {Xi ~ ai)(dx)3 

(CO) 


These integrals are special cases of the more general integrals 

#*? = 4 ^; J <lu( dx ? 


(CO) 


and 


B[ a h = 4 ^ J (*< ~ 


(CO) 


In addition to the integrals discussed above, 


(74.10) 

(74.11) 

(74.12) 

(74.13) 

(74.14) 

(74.15) 

(74.16) 

(74.17) 


other useful integrals may be expressed in terms of the quantities (74.15) and 
(74.16). 
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W e consider the volume integral of the pressure p 

I = J p(dx) 3 


(74.18) 


(.a) 


Since the pressure vanishes outside the mass we obtain by partial integration 

8p 


37 


= — J (x { - a ( ) ( dx) 3 


(a) 


Using the relations (73.19) we can also write 


37 


(a) 


(dx) 3 


(74.19) 


(74.20) 


However, in view of the fact that Qa is a homogeneous quadratic function of 
the differences xi — at, it follows from Liapunov’s equation that 


/ \ ^ a / \ ^ Ua i on 

(a H - a-i) -r— = ( x i — a-t)~x -b 2Q 0 

OXi CXi 


On the other hand we have by (71.18) 


i 


du a = 

^ 8xi 4Try dxn 


Inserting (74.21) into (74.20) and using (74.22) we get 

37 = - J (xi - at) (dx) 3 - 2 | pil a (dx) 3 


(74.21) 


(74.22) 


(74.23) 


Integrating by parts and using the expressions (74.03) and (74.13) for T a and 
z a we finally get 

3 f p(dx) 3 = z a 2 T a (74.24) 

Co) 

This relation shows that when the body rotates the mean pressure within it 
is less than in the absence of rotation, as is to be expected. 

Similarly we get the relation 

2 f p ■ (x t - at)(dx) 3 = Tj* - Tat (74.25) 

(a) 

Where *lai = J {(** - a kHk + (*i - a MkkW X f ( 74 - 26 ) 

(ao) 

or, in the notation of (74.16), 

^ + B( dk) ( 74 - 27 ) 

Equations (74.15) and (74.16) give a representation of Bff and as 
integrals over all space, but they may also be represented as integrals extended 
only over the volume occupied by the mass (a). To make the transformation 
to this form we introduce a function w a which is defined by the equation 

w , « = |J P' | r r' | (dx') 3 

(a) 


(74.28) 
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by virtue of which we have 

Aw a = u a 

It is then not difficult to prove the equation 

_L [^i^ (dx)3= f 

4tty J dx t dx k 1 } J 

(°°) (a) 

whence, by the use of (74.11), we get 


p —(j*r 

dxi dx/c 


(74.29) 

(74.30) 


=■ J P («,,*». - W (,4 - 31 » 

(a) 

Here the integral extends only over the volume of the mass (a). 

Inserting the integral representation (74.28) for w a into (74.31) and perform¬ 
ing the differentiations we are able to write 


11 f PP' (X< ^ idxfidx'f (74.32) 

(a) ' ' 

In analogy with equation (74.31), which may be written as 

(74 - 33) 

(a) 

one can also prove the relation 

J P - «<><*=)• - -A - Bjf„ (74.34) 

(a) 

These relations will also be needed in the following. 

In conclusion we note that as a result of (74.22) and of the equations of 
motion (73.04) the quantity divided by 47iy may be interpreted, within 
the framework of Newtonian theory, as the stress tensor of the gravitational 
field of the mass (a). 


75. Transformation of the Integral Form of the Equations of Motion 

We take as our starting point the integral form of the equations of motion 
that was given at the beginning of Section 73. We restate these equations 
taking into account that by (73.18) the stresses reduce to an isotropic pressure 
p. We have 



with 


(75.02) 



284 

and 


The Theory of Space Time and Gravitation 
A?7( = — 47Tfp»j 


(75.03) 


Putting for brevity 


we can write (75.02) as 


e+£<!»■+ n-(/)+g 


1 8*U* 

AC/* - - —— = - 4^0 
c 2 8t 2 ' 


(75.04) 


(75.05) 


Neglecting small quantities we get from (75.01) 


J jp^f 1 + ~z d v2 + 11 + 3f/ )j + Jz P v i) ( dx Y 

CdU* 4 f (8U t 8UA,, „ 4 f 817*,.., 

= j— ff (d,)3 + -j p (_ r+ , i _)( ( &)3__J P ^—(dx) 2 


We note first that 


(75.06) 


+ = < ,5 - 07) 

(a) (a) 

so that this term may be taken over on to the left-hand side and combined 
with the other terms that are differentiated with respect to time. 

We then consider the first term on the right of (75.06). It is a generalization 
of the expression (71.11) evaluated in Section 71 which referred to the Newton¬ 
ian approximation. As in (71.10) we split the potential U* into an internal and 
an external part 

V* = w* + Z7*» (75.08) 

and then have 

C 81J* , C 8u* r 8U* (a) 

j—„(*). = J_, W . + J_„W (75.09) 

(a) (a) (a) 

In Newtonian approximation the first term on the right would vanish by 
(71.19), being the resultant of the internal gravitational forces. But in relativis¬ 
tic approximation, when Poisson’s equation has to be replaced by (75.05), 
this is no longer so owing to the retardation. We insert into our integral the 
value for a given by the relation 


* 1 d 2 «* 

Am S- c t^f=- 4 ^ 


(75.10) 


which holds within the mass (a) and we use the fact that the integral of the 
term involving the Laplace operator vanishes. This gives 

■‘**- 5 ^/££<*>• 



or 


because 
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C8u* 7> Id Cdu*8u* 

—- a (dx) 3 — -— —- —- (dx) 3 

J dxt 'inrc 2 dt J 8x{ 8t 


(CO) 


fi5 

J 8xi dt dt 

(CO) 
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(75.12) 

(75.13) 


Owing to the smallness of the factor outside the integral we can replace the 
quantity u* on the right-hand side by the Newtonian potential u a ; equation 
(75.12) then becomes 

f 7 1 d C du„ 8u„ 7 

to) (“) 

hollowing (74.28) we introduce the quantity w a related to u a by 

A w a — u a (75.15) 

This allows us to transform the integral on the right-hand side of (75.14) and 
to write that equation as 

(KU6) 

to) (a) 

The last relation could have been derived more directly by inserting the 
approximate solution of (75.10) into the integral on the left-hand side. 

Thus the first term on the right-hand side of (75.06) may be written as 


f 


8U* 


J dxi 


a (dx) 3 


J 


0£7*<«> 

dxt 


a (dx) 3 


f 

c 2 dt J 


p ST?/* 0 ' 


to) (a) to) 

We make yet another transformation. We introduce the function 


W = |rJ p r [r —- r'\(dx') 3 


(75.17) 


(75.18) 


( 00 ) 


which is a solution of the equation 

AW = U (75.19) 

V being the Newtonian potential. It is evident from a comparison of (74.25) 
and (75.18) that 


W= lw a 


(75.20) 


just as the Newtonian potential U is the sum of the potentials of the separate 
masses 

U=2 u a (75.21) 

a 

The analogue of the splitting (71.10) of the Newtonian potential into an internal 
and an external part is 

W = w a 4 - W <a) (75.22) 

Using this, we can replace (75.17) by 


J 

a) 


8U* 

dxt 


a (dxf 


= f 


8U* (a > 


J dXi 

to) 


a (dx) 1 


-Li f 

c 2 dt J 

to) 


0 2 W<“> 7 „ Id f 

p ^ ix)+ 7‘S) 

to) 


8 3 W 

9 t -— rr(dx) 3 
dxfdi 

(75.23) 
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Repeating the arguments that led us to equation (75.16), now applied to the 


total mass, we obtain the relation 

feu* „ i dr 


8 2 W 


(75.24) 


(CO) (CO) 

whence it follows that 

2 1 r 8U* (a) i dr s 2 w (a) ) 

[ l nsr ° - si, \ f H “ 0 ' (m25) 

0 (a) (a) 

We now consider the last term in (75.06). We divide the “ vector potential ” 
Uk into an internal and an external part 

U k = u ak + £/<“> (75.26) 

and conclude as before (see Section 71) that by virtue of Poisson’s equation 
(75.03) we have 

pvt —— (ete) 3 = 0 


/' 

(a) 


8x{ 


and that therefore the last term in (75.06) will be equal to 

- r* J pl '* ? J 


We also have 


I 

CO) 


(a) 


p Vic - ((to) 3 = 0 

oxi 


(75.27) 


(75.28) 


(75.29) 


We have discussed all the terms on the right-hand side of (75.06). Of these 
terms we transfer (75.07) and the last term on the right of (75.23), both of the 
form of time derivatives, to the left-hand side; then the equations of motion 
(75.06) assume the form 

dPai 

‘ = Fat (75.30) 


dt 


where 


p » = IH 1 + + n + H + 

(a) (75.31) 


and 


F a t 


!■ 


8U* (a) 

8xt 


ct (dx'y 


_ 1 d r 
c 2 dt J 


8 2 W< a \ _ „ 4 

P-(da;) 3 -- 

P 8xi8V ’ " 2 


J 


pv/c 


8uy> 

——((to) 3 


8xt 


(75.32) 


(a) (a) (a) 

As a consequence of equations (75.25) to (75.29) we have 

!F ai = 0 (75.33) 

a 

and therefore 2 Fat = Pt = const. (75.34) 

a 

By analogy with Newtonian mechanics the quantity Pat may be interpreted 
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as a component of the momentum of the mass (a). Then the quantity F a i 
will be the component of the force acting on that mass. Such an interpretation 
is very natural in Newtonian approximation, but here it is somewhat artificial 
because according to (75.31) the momentum P a i then depends not only on the 
internal structure of the body and its velocity, but also on the potentials U, 
Ut and W. 

We shall now separate off those terms of P a t in (75.31) which depend only 
on the internal structure. 

Within the mass (a) the quantities u a and u at satisfy the equations 

Am« = — 4ttyp ; A u a t — — iiv^pvt (75.35) 

Hence it follows that 

f pviu a (dx) 3 = f pu a i(dx) 3 (75.36) 

(a) (a) 

Using this relation we can express P 0 < as 

p “ = J {?*>< + <4»’ + "-*•>+jii"<-?£rs! (ir) ’ 

(a) 

3 f 4 f If 8 3 W (a) 

+ - 2 J p v t V (a \dx) 3 - — J p U[ a \dx) 3 - — J p ^ ^ {dx) 3 (75.37) 

(a) (a) (a) 

Here the first integral depends only on the motion and the internal structure 
of body (a) whereas the remaining three terms depend also on the potentials 
of the external field, i.e. on the interaction of (a) with the other bodies. We 
shall deal with the evaluation of these integrals in the following section. 

76. Evaluation of the Momentum in Second Approximation 

In the preceding section we reduced the equations of motion to the form 

^ = Fai (76.01) 


where the momentum P a t and the force F ai were expressed as the integrals 
(75.31) and (75.32). We must now evaluate these integrals and express them 
in terms of the parameters that characterize the motion of the bodies as a whole. 

In order to evaluate P a i we use its representation in the form (75.37). We 
split the momentum into the part intrinsic or “ proper ” to the body and the 
part arising from the interaction, writing 

P'ai = (Pat) VIop + (-Pointer (76.02) 


with 


(Pai )prop = J R Vi(dx) 3 + ^ j ViUpV* + pH - pU a + p)(dx) 3 

(a) (a) , 

_ C* J 


dhoa 
dx{ 8t 


(dx) 3 (76.03) 


and 


<r«,U, -f «»"(*>■ - 3 J - ? / 

C (a) W <«> 


a 2 W (a) , ? 

P -^-^r(dx ) 3 


8xi 8t 


(76.04) 
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We first calculate the integrals appearing in (76.03). The first gives the momen¬ 
tum in Newtonian approximation—it is 

f pvt(dx) 3 = M a &i (76.05) 

(o) 

in agreement with (71.08). In the evaluation of the second integral we utilize 
the relation 

pn — p u a + p = pQ 0 (76.06) 

which follows from (73.26) and (74.02) and also the relation 

§u 2 = \d\ + d k (o<$(x. - a } ) + Q 0 (76.07) 

which is a consequence of the equation 

«< = «< + affix} - a } ) (76.08) 

for the velocity within the body (a). The second integral may be seen to be equal to 

/ pvdiv* + n a )(dxf = (lM a dl) d t + 2 T a d t + «o<«)o %T r ?d k + 2co §>T a , 

(o) 

(76.09) 

with the notation of (74.03) and (74.05). The last integral in (76.03) may be 
transformed as follows: 


- =J 

(a) (a) 


d2Wa U \3 


(76.10) 


With the use of (74.33) and (74.34) we then get 


-I 

(a) 


P = *•“« ~ B ik d k + ~ <»?k B T\k 


We collect all three terms and introduce the notation 

= (2 T a + «.) i* + ~ 

and 

Z \ a) = Hi )T aJ + ~ 

This gives for the intrinsic part of the momentum 

(Pai) prop = M a d i + 7i (s x ^a“|) *< + “2 


(76.11) 

(76.12) 

(76.13) 

(76.14) 


Here the first term represents the Newtonian expression for the momentum, 
as we have already noted. The second tenn gives a correction well-known from 
point mechanics. The last term may be interpreted in terms of the notion of 
a tensor of effective mass, which is the matrix occurring when the components 
of momentum are expressed as linear functions of the components of velocity. 

If we put 

Z = 2 ( W? + T a ) + l^[lM a (dlf + + Z\*\] (76.15) 

a a 
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we evidently have 8K 

( at 'prop = gV- 

This equation determines the quantity K apart from a function independent 
of the a ( . In (76.15) we have included a term 'fT a in order to ensure that in 
non-relativistic approximation K should go over into the usual expression 
for the kinetic energy of the system of bodies. 

We note that for non-rotating bodies with spherical symmetry we have 


^ik ~~ j ^ J ilc — > ty a) — 0 

Therefore, if we introduce the effective mass 


m a 


Ma + W Za 


(76.17) 

(76.18) 


we shall have, apart from small corrections, 

(76 - 19) 

a a 

just as for a set of mass points. Equation (76.18) shows that in this case the 
effective mass tensor reduces to a scalar. 

We now go on to the evaluation of that part of the momentum which depends 
on the interaction. We begin by estimating the order of magnitude of this 
part. It is readily seen that all three terms in (76.04) will be of the same order 
of magnitude, namely 

= (76.20) 

where q is the quantity already used on several occasions which is of the order 
of the velocities in question. When calculating the integrals in (76.04) we retain 
not only the leading terms, which are of the order of magnitude just mentioned, 
but also terms of order LI It in relation to the main terms; terms of higher order 
in L/R will be discarded. 

To calculate the integrals to the required accuracy it is sufficient to retain 


only the first term 

in the expression (71.27) for the Newtonian potential, so 

that we take 


(76.21) 

In the expression 


(76.22) 


for the vector potential, on the other hand, we must retain not only the main 
term but one more. Using (71.25) we get 




8x.k | r — b 


Finally the function 


(r) = 2 ^ Y J P' I r “ r ' I ( dx> ) 3 
b (» 


(76.23) 

(76.24) 
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will be equal to 

H w <r) = 12>.l r — b| + <76.25) 

6 b t b OX] OXic 

to the accuracy required. 

Here we have retained one term beyond the first in order to ensure that the 
derivative of W (a) with respect to time may be obtained to the required accuracy. 

Inserting into the first integral of (76.04) the expansion of U {a) (r) into a 
Taylor series about the point Xk = a*, i.e. 

£/(«) (r) = j/fo>( a ) + {xk _ ak) + ... (76.26) 

\ vXje ) a 

we get 

3 f 9 v t V^(r)(dxf = SM a d i V (a) (a) + (^] (76.27) 

J \ OXk /a 

(a) 


Here the expression (76.21) for the Newtonian potential due to the external 
masses must be inserted. We then get 


3 j - y +1 %«.<■>«> ^ ^ 


(76.28) 


The second integral in (76.04) may be evaluated similarly. We have 

- 4 J P H{°>(r ){dxf - - 4:M a U\ a) (a) (76.29) 


and if the expression (76.23) for U\ a) is inserted : 

’ i)hi 


- 4 j puywdxr =- 2' + 2' «,<■>» ^ 

(a) 


Finally, the last integral in (76.04) is 


“I 

(a) 


S 2jp(a) 

3aj St 


(da;) 5 


/a 2 W< a >\ 

” _ “\ dX] 8t ) a 


bj 

(76.30) 

(76.31) 




8 2 ji 


(76.32) 


Differentiating (76.25) with respect to time we get 
dW (a) 1 NT' , Sir — b I 
~dT = ~2^ yMbhk dvT~ + 4 ^ Sa, 0a* 

To the accuracy here considered we must neglect terms containing third 
derivatives of [r — b|. According to (72.16) the quantity has the value 

( 76 - 33 ) 

By differentiating (76.32) we obtain 
0 2 W(“) 

8xi 8t 

and therefore 
8 2 W <a> 


- -1 y y M b b t a 'l- r -H +ly T W ^ ~ b l (76.34) 

9 ^ ^ 3Xi dX]c 4 - ^ fir* fir.* fir.u 


!" - i ? A»A - j Z 


0a{ Say 0a* 

0 3 |a — b| 


dat 8a] dak 
(76.35) 



The Law of Gravitation and the Laws of Motion 291 

According to (76.04) the sum of the expressions (76.28), (76.30) and (76.35), 
divided by c 2 , gives the quantity (Pointer- 

We now introduce two functions K x and K 2 such that the first is homo¬ 
geneous quadratic in the velocities and homogeneous of degree —1 in the 
coordinates, while the second is homogeneous and linear in the velocities and 
homogeneous of degree (—2) in the coordinates. 

We do this by putting 

Ii, = T» 2 T~~r (H 2 + ‘ ih t ~ M A) + — T T M a M b d A 
4c2 ^l a - b l 8 <*i 8a * 

taVb) (o/6) 

(76.36) 


a, b 
(a*b) 


and 


= 7^ 2 ««( 3 “ ( “ 4 *<) ~ KoW&i - 4 «<)1 


a, b 
(a^b) 


dajc a — b 


a, b 
(a^b) 


da,j 8a ^ 8a i 


It is then easy to verify that 


Remembering the estimates 

-b 


dK i 8K o 

(Pat)ia ter = ~ddi + ~8dl 


I ~ ML 2 ; 1 ~ MqL 


(76.37) 

(76.38) 

(76.39) 


it is readily seen that the order of the functions K t and K 2 is given by 


K.-M'L; K 2 


q* L 
M--- 
c 2 R 


(76.40) 


Thus K 2 will be small in comparison with K v 
Using (76.16) and (76.38) we can write the total momentum in the form 


P 0< = — (K + K. + K,) (76.41) 

cai 

where K, K 1 and K 2 have the values (76.15), (76.36) and (76.37) respectively. 


77. Evaluation of the Force 

In order to evaluate the integrals which according to (75.32) are involved in 
the expression for the force 


Fai 


-s 

(a) 


au* ( “> 

dxt 


a (dxf 


id r 
c 2 dt J 

(a) 


H 8x ( 8t y ' 


if 

c 2 J 

(a) 


pVjfc 


^ (dxr 

OXi 


(77.01) 

we must first of all fin d the potential V* to sufficient approximation. As regards 
the potentials W and 77* the first order expressions already derived for them, 
(76.32) and (76.25), will be sufficient. 

According to (75.05) the potential U* satisfies the equation 


1 8 2 U* 

A 77* - - —- = - irrfo 
c 2 St 2 1 


(77.02) 
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where a is given by 

a = p + -^(|p« a + pll — pU + 3p) (77.03) 

c 

and differs little from p. If we know the Newtonian potential U that satisfies 
the equation 

AU = — 47ryp (77.04) 

the generalized Newtonian potential U* may be obtained from V by introducing 
two corrections. The first is for retardation, the second arises from the replace¬ 
ment of p by a. The latter can be written as 

^add = Y (77-05) 

<®> ' * 

As regards the retardation correction, it can be expressed in terms of the 
function W already introduced by (75.18) 

^ = i Y jV|r — r'| (dx'f (77.06) 

( 00 ) 

We then have 

1 s*w 

U*=U +-— + U> M (77.07) 

c 2 ot‘ 

We now deal with the evaluation of f7 a <ni. We write the difference of a and p as 

<y — p = —Mpv 2 + pn — pu a + 3p) — i pU {a1 (77.08) 

c 2 c 2 

Here the first term depends only on the internal structure of the body and on 
its velocity while the second depends also on the external potential. Using the 
equations (76.06) and (76.07) we get 

f (Ip^ 2 + pn — pu a + 3 p){dx) 3 = | M a a\ + \ a (77.09) 

(a) 

where £<, = f (4ptl a + 2 p)(dx) 3 (77.10) 

(a) 

Also, the first moment becomes 

/ (ipv 2 + pn - pu a + 3 p)(x t - a { ){dx) 3 = 3d fc o >$/<f + \ ai 

(a) 

(77.11) 

where Eat = J (4pQ a + 2p)(X{ — a ( )(dx) 3 (77.12) 

(a) 

The integrals appearing here were evaluated in Section 74 : 

\a — fe 0 + \T a (77.13) 

and — fiat + 3 T a t (77.14) 
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Hence we get, approximately 


to) 


Ipv 2 — pn — pp 0 + 3 p) 


, {dx'f 

|r — r' 


= (l M a A l + U r „ - 1 -„T + VwTMf + Zath* 


r — a 

i i 

Further we have, to the accuracy required, 


r — a 


a) (a) 

■ a| 


( J,l r - r l l r_ . 

Combining these results we can express the quantity (77.07) as 

ZJaaa = l J (IMA - M a UA a) + Z.) r^r 

a I > 

+52(^w>+u l ^°' 
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(77.15) 


(77.16) 


(77.17) 


The equation (77.01) for the force acting on mass (a) involves not the whole of 
U* but only that part which is due to the external masses. 

This part is 


1 8 2 W ia) 

t7*to> = C /to, + _^_ +C /to, 


(77.18) 


where U {a) ( r) and W- a) {r) are the quantities already determined in Section 76, 
and the additional term Z7^ d is given by 

U<& = 1 2 - M,mk) + y |7 2^] 

+ 1 2' (34 ‘“!W + < 77 - 1! » 

The values of potential obtained must now be inserted into the expression 
(77.01) for the force. We get 


f 8lT a) C dU< a X f 

to) 

+ ^ l 


dU (a) 


Pa ~ ’ C 2 (It J 

to) 


to) 

8 2 W <a) 


dxi 8t 2 ' 


8xi St 


dxt 
(dx) 3 


(a - p ){dx) 3 


4 f 8 U k\a v 

J 

to) 


(77.20) 


Here the first integral is the Newtonian expression for the force ; it has already 
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been evaluated in Section 71. We have, by (71.33) 

f dVUl) U \3 3® 


(77.21) 


where <J> is the Newtonian potential energy of a system of bodies, as in (71.32;. 
To the accuracy required the second integral may be written as 




(77.22) 


Here the expression (77.19) can be taken for The third integral can be 

evaluated by use of the relations (77.08) to (77.14). We obtain 

C dU M 1 /aZ7<«>\ 

(a) 

i /a 2 c/ <a >\ i / 8U (a) \ 

+A + w - M u “-^h <77 - 23) 

where the expression (76.21) may be used for U (a) . 

We add the expressions (77.22) and (77.23) and represent the sum as a 
derivative with respect to a<. In (77.23) we may put = at before differentia¬ 
ting, but in (77.22) we must take into account that one of the terms in U (6) (b) 
itself depends on at, namely 


|a — b 


(77.24) 


where the dots denote terms not containing at. As a consequence of (77.24) we 
have 


(77 - 26) 

Using this relation we get 

I nOr p '*>• + / - f){dxf ~h L ' + Lt) ~ S; 


where we have put 

L i = i 2 + H) + 2y(5 6 M fl + Wi^ 


(77.26) 


(77.27) 


= 2 ? 2 


(77.28) 
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and where the function Y is equal to 






\ 

a — b | v 


+ — M a V^Y+ . . . 

(77.29) 


with dots again denoting terms independent of o<. We arrange these terms in 
such a way as to make Y symmetric in all masses and the corresponding position 
vectors. We put 


y 2 McM b (M a + M b ) 
T(a,b)_ 2c 2 ' |a —b| 2 


(77.30) 


and 


Y(a,b,c) = I i M a M b M c 


x 


(|a —b| |a — 


+ 


b — c j |b — : 


+ ■ 


— a| |c —b|) 
(77.31) 


where, temporarily, we have denoted the speed of light by c 0 , to avoid confusion 
with the suffix in M c . Thus it is readily seen that in the function 


Y = | 2 m b) + i I Y(a, b, c) (77.32) 

a, b a, b, c 

(a*b) (a*b, b*c, c*a) 

the'terms depending on a< are the same as in (77.29). Thus in the equation 
(77.26) for the sum of the two integrals Y may be understood to stand for 
(77.32). The terms Y(a, b, c) give peculiar “ triple interactions ” of the masses. 

We now consider those integrals in (77.20) which involve the function W (a K 
We have, evidently 



d 3 W ia) 

P dxfdt 2 


(dx) 3 


1 d r 

c 2 dt J 


(a) 

1 r dp 8 2 W (a) 

c 2 J dt 8xi dt 

(a) 


d 2 W (a) 

P dXi dt 


(dx) 3 



^3 (cl) 

pvj- —-— -(dx) 3 
oxt dxj dt' 


(77.33) 


This expression can be evaluated by use of the relation (76.34) for the 
derivative of W {a) . We get 


where 


and 


-iJ 

(a) 

L 3 ~ 4c 2 2 T MoMidjbk 


cPW^ dL 3 dL 4 

pvi - (dx) 3 =-- 

^ 1 dx t dx } dV ’ dot 


3a% 


d 2 |a 


da,] dajc 


— V y (M b bJ® - Mdjm ^—^ 

8c 2 n » * kl a * kU da, da. da, 

n. h ) K l 


a, b 
(a?b) 


(77.34) 

(77.35) 

(77.36) 
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It remains to examine the last integral in (77,20). Its evaluation presents no 
difficulties and we get 



4 f 8L. 8L 6 

cj pU * {dx) da t + dot 

(77.37) 

where 

fa) 

T 2 yM a Mb(dkbic) 

c* Z |a — b| 
fa *6) 

(77.38) 

and 

(77.39) 


a, b 
(a^b) 


We collect all integrals and so get the following expression for the force 


F at =- 


Sat 


H + i (Li + L > + L > ] + 4 L * +L « ] 


(77.40) 


Let us compare this expression for the force with the previously derived 
expression (76.41) for the momentum. We put 



0l ~ 2c 2 2 Y ^ bMa + laMb) la -b| 

ctt b 

(77.41) 


(a/6) 


and 


(77.42) 


fa#M 



The negative of these quantities also enters the expressions (77.27) and (77.28) 
for L y and L 2 . 

It is simple to check that we have 

F 1 J rL 3 +L 6 = K 1 — <E> X (77.43) 

and similarly 

+ L t L t — K z — <I> 2 (77.44) 

where K x and K 2 have the values (76.36) and (76.37). Therefore 

Fat = ir (K x +K t -<l>-<b 1 -<b t -'¥) (77.45) 

CCii 

On the other hand, by (76.41) 

P ai — + Ki + K 2 ) (77.46) 

oat 

where Ii is given by (76.15). 

Here K is independent of the coordinates at while the quantities O, 0 1( 
d> 2 and T’ are independent of the velocities d{. Putting 


L = K + K, + K t - <D - - <D 2 - T 


(77.47) 
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we can therefore write 

dL 

„ dL 


P ai — —- ; 
oai 

Fai = fot 

(77.48) 

The equations of motion 

dP ai 
dt 

= Fat 

(77.49) 

then appear in the Lagrangian form 

d dL 

dL 

(77.50) 

dt 3d\ 

-aS - 0 


78. The Equations of Translational Motion in Lagrangian Form 

In the preceding sections we have derived the equations of motion for the 
mass centres of a set of masses, by use of the condition 

f gV a T at (dxf = 0 (78.01) 

(o) 

which in turn was derived from the harmonic condition (see Section 70). It 
was assumed in the derivation that each mass rotates about its own mass centre 
in the manner of a rigid body. 

As we saw, the equations of motion are expressible in the Lagrangian form 


d dL dL 

dt Sdf 


(78.02) 


where the Lagrangian is obtained by inserting into (77.47) the expressions 
previously found for K, K u K 2 , <I>, 0 1( d> 2 and 'F. (See equations (76.15), 
(76.36), (76.37), (71.32), (77.41), (77.42) and (77.32).)f We restate these 
equations. We have 


L = K + K l + K i -<$> -Oi-Os-'F (78.03) 


where the various terms have the values set out in the following : 


K = 2 ( + T a) + c 4 2 + WA + 

a a 

(78.04) 

Here the first sum represents the usual Newtonian kinetic energy of the 
translational and rotational motion of a system of bodies. (In forming the 
left-hand side of Lagrange’s equations it is inessential to include the term 
To-) The second sum gives a correction to the “ kinetic ” part of the Lagran¬ 
gian, i.e. to that part which depends on the velocities. In the absence of rotation 
this correction reduces, according to (76.19), to the usual correction term well- 
known from the mechanics of mass points ; when rotation is present this cor- 

t For non-rotating masses the reduction to Lagrangian form was first performed by 
I. Fichtenholz [40]. 
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rection also involves “ mixed ” terms which depend both on the velocities of 
translational motion and on the angular velocities of rotation of the body. 
The quantity K does not depend on the position of the mass centre of the body. 

We now state the values of K x and K 2 . According to (76.36) and (76.37) we 
have: 


- 4? 2 tSjM+ - “A) + 2 r 

a,b I a,b 


and 


K* = 2 - H) - Y- 46,)]. 

a , b 


a. 6 


0»|a -b 
8a t 8a k 8q l 


Oj hj 

|a — b | * 

I 

(78.06) 


These terms depend both on the coordinates and on the velocities and repre¬ 
sent, so to speak, the result of an interaction of kinetic and potential energies. 
As we already noted in Section 7 6 the quantity K x is homogeneous and quadratic 
in the velocities of translational motion dt whereas K z is bilinear in the velocities 
of translation and the angular velocities. 

The next term in the Lagrangian is : 


1 ^ y MaMb 



Y(M a /<«+M 6 /c»>) 


a 2 l 

Sat 8a t | a — b 


(78.07) 

It represents the Newtonian potential energy of the system of bodies. It is 
augmented by the corrections <J> 1 and 4> 2 which, by (77.41) and (77.42) are 



a,b 1* 

(78.08) 

and 

- ®.=i 2 - w^irrir. 

a, b 1 

(78.09) 


The quantity may be interpreted, as the result of replacing the mass M a in 
the Newtonian potential energy, or more precisely in its leading termf 


O, 


_ 1 V Y MaMb 

~ la -b 

a,b I 


(78.10) 


by the effective mass M a + 8M a , where 



(78'. 11) 


On the other hand, the quantity 4> 2 may be interpreted as the result of dis- 

t In the equation (78.07) for © the correction term is of order relative to the main 
term. Consequently relativistic corrections to the correction term will be of the same order 
as terms already neglected. Therefore it is immaterial whether such corrections are intro¬ 
duced into the complete expression for © or only into the main term © 0 . 
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placing the mass centre of body (a), so that a< goes over into a< + Sat, where 

*“ - (78.12) 

Indeed we have, apart from small quantities, 


j y(Ma + SM a )(Mb + SMb) 

| a -j- 8a — b — 8b | 


= O 0 + <E>i + 0 2 


(78.13) 


We note that the quantity \ a involved in the equation for the effective mass 
is equal to the intrinsic energy of body (a), which is composed of the kinetic 
energy of its rotation around its mass centre, of its elastic energy and of the 
gravitational energy of its component particles. Indeed it can be shown that 
the expression 


la = Ta + J pn (dx) 3 - 1 J p Ua{dx) 3 (78.14) 

(a) (a) 

for the intrinsic energy goes over into the form (77.10) for if Liapunov’s 
equation and (73.26) are used. 

It might be expected that the presence of the mass related to the intrinsic 
energy of a body would give rise not only to a change of the effective mass 
but also to a displacement of its mass centre. However, in view of the fact 
that the kinetic energy of rotation about its mass centre refers to the body as 
a whole and is not related to the individual constituent particles, it would be 
difficult to predict the correct expression for the displacement of the mass 
centre. Actually the calculation has proved it to be given by (78.12) with 
lai given by (77.12). 

It remains for us to write down the last term in the Lagrangian. It is con¬ 
venient to denote the three position vectors occurring in it by a, b and c and 
we therefore adopt the notation c 0 for the speed of light, as was done in Section 
77. According to (77.30), (77.31) and (77.32) we have 


__'F = 


Y 2 v MaMb{M a + M b ) 

4 c 2 Z |a — b| 2 

0 a,b 1 I 


Jl 

6c 2 


2 M a M b M c 
a, b, e 


X 



+ 


1 

b — c j ]b 


1 

|c — a| jc — b| 


■) 


(78.15) 


Here the last sum gives the triple interaction of the masses.f If in the sense 
of a remark made above we call the terms and K 2 the “ kinetic-potential ” 
part of the Lagrangian the quantity V F may be called its “ potential-potential ” 
part. These names stress the fact that in the second approximation of Einstein’s 
theory the additivity of kinetic and potential parts of the Lagrangian, character¬ 
istic of Newtonian theory, no longer prevails. 

According to (78.03) the total Lagrangian is the sum of the expression (78.04) 

t In the triple sum the labels satisfy the relations a yt b, b ^ c, c ^ a, and in all double 
sums we must take a ^ h 
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to (78.09) and (78.15). To the present degree of approximation it leads to the 
equations of translational motion with relativistic corrections. 

As for the equations of rotational motion, they may be obtained from the 
relations (70.28) which have the form 

~ f flrteV a 2« - x k VJ*%lxf = 0 (78.16) 

(a) 

In Section 72 it was shown that in Newtonian approximation these equations 
determine the rate of change of the angular momentum 


of each body, giving 


—CTf 



«»* - W !f ~ to -><m 


(78.17) 


(78.18) 


(see equations (72.07) and (72.13)). 

The relativistic corrections to the Newtonian equations of rotational motion 
may be obtained by a more exact evaluation of the left-hand side of (78.16). 
These calculations can be performed on the pattern of those performed for 
translational motion in the preceding sections. They do not present any 
difficulties apart from the Complexity of the manipulations involved. However, 
since relativistic corrections to the rotational motion of celestial bodies are 
quite insignificant and are even harder to observe than the corrections to their 
translational motion, we shall not evaluate them here. 

79. The Integrals of the Equations of Motion for Systems of Bodies 

Just as was the case in Sections 26 to 28 for a system of particles interacting 
by means of the electromagnetic field, a system of gravitating bodies has the 
property that its equations of motion possess the ten classical integrals, or 
constants of the motion, namely the integrals of momentum and energy and 
those of angular momentum and of mass centre motion. 

We shall now derive general expressions for these constants of motion in the 
form of definite integrals. In doing this we shall proceed as in Section 71 and 
.make use, for the internal motion, of the general equations of motion of an 
elastic body in non-relativistic approximation. We restate these equations. 
We have the equation of continuity 


dp d(pvt) 

8t 8xt 


(79.01) 


the equations of motion proper 


8U 


pWi - p— 


('Pile 

8x k 


in which the acceleration wi is given by 


(79.02) 
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and finally the relation giving the elastic potential energy : 



(79.04) 


We shall not at present introduce the assumption that each body rotates in a 
rigid manner. 

To obtain the constants of motion we may start from the relations which, 
according to Section 70, result from the harmonic conditions. The relations 


- f gVJ*<{dx) 3 = 0 (79.05) 

(co) 

integrated with respect to time give us the three momentum integrals, and the 
relation 

- f gVJ a0 {dx) 3 = 0 (79.06) 

(vO) 

integrated with respect to time gives the energy integral. Similarly the relations 

- f - x k V a T ai )(dx) 3 = 0 (79.07) 

(€ 0 ) 

will give us the three integrals of angular momentum and finally the relation 


- (' gxfJJ^dx) 3 + * 0 J gV a T ai (dx) 3 = 0 (79.08) 

(CC) (CO) 


(in which, incidentally, the second term vanishes by virtue of (79.05)) leads 
to the three integrals for the motion of the mass centre. 

We begin with the evaluation of the momentum. In deriving the conserva¬ 
tion law from (79.05) we can use the results of Section 75. Generalizing equation 
(75.31) a little we put 


Pi 


= J {pt*[l +^i»* + n +377) 

(CO) 


1 

- -fpikVk 
c 3 





(79.09) 


and then form the derivative of this quantity with respect to time. Using 
(73.01) we first find 


dPi 

dt 


f 


8U* 

- a 

8xt 


(dx) 3 


1 d [ S ’ W u 4 f eV ‘,J rV 


(79.10) 


where «r = P + -fW + II - V) - (79.11) 

c c 

and the function U* satisfies the equation 

1 8 2 U* 

AC/ *--,-^¥-= ( 79 - 12 ) 

But since all integrals in (79.10) are to be taken over the whole of space, relations 
(75.24) and (75.29) may be applied to them, according to which 
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J 

* 8U* ,, „ 1 d r 8 2 W ,, 

——a (dx) 3 i= — — p ... -(dx) 3 

8xi c 2 dt J r 8xt dt v 

(79.13) 

and 

1 - 0 

(79.14) 

Hence it follows that 

dP { 

-j- — 0; Pi = const. 

(79.15) 


so that Pi is a constant of the motion. This result represents a generalization 
of the relation (75.34) for the general case of a system of elastic bodies inter¬ 
acting by means of the gravitational field. 

We now go on to the formulation of the conservation law for the angular 
momentum of such a system. Introducing the expression (70.22) for gV a T a( 
into (79.07) and using the relation (70.26) for the components of the mass 
tensor we get 



P + + n + 317) j (Xiv k — x k Vi) — —fpjkVjXi — pnvfx^dxf 


4 


c 2 



(79.16) 


where for brevity we have put 


dU { 

dt 


8Vi 

~dt 


+ V] 


dVj 

dXj 


(79.17) 


If in (79.16) we extend all integrals over the region occupied by one of the masses 
we obtain a generalization of the relation (72.04) that gives the law of the change 
of angular momentum in Newtonian approximation. We are here interested 
in the total angular momentum so that the integrals must be extended over 
the whole of space. 

We introduce the notation 


Gi = 


p + ^ + n + 317 ) 1 * - \ vm - ipf/, - Ip 

c c c c 


8*W 

dxidt 


using which, we can write equation (79.09) for the total momentum as 

Pi = | Gidxf 

We can then represent (79.16) in the form 


(79.18) 

(79.19) 


d 

Jt 


J (xiG k — x k Gi)(dx) 3 

r l 8TJ* 8U*\ 

~~ J r* - Xk 


la (dx) 3 


1 d 




8W 
dx k dt 


— Xk 


8 2 W 
dxi 8t, 


j (dx) 3 


? / - x ‘ikY dx) ‘ - ? J ■ nc,,xdx> ’ 

(79.20) 
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We prove that the first two integrals on the right cancel and the remaining 
two are separately zero, so that the entire right-hand side vanishes. 

In evaluating these integrals we shall use the following lemma: 

If the function cjj may be expressed in terms of the density p as 

4* = J* /(| r — r' | )p(r')(<fc') 3 (79.21) 

where/is some function of the distance |r — r'|, then the relation 

<79 ' 22) 

is true. It is assumed here that in equation (79.21) one may differentiate under 
the integral sign. To prove this it is sufficient to insert the values of the deriva¬ 
tives of 4 1 , as calculated from (79.21), into (79.22) and to note that in the 
resulting double integral the integrand is antisymmetric in the coordinates of 
the two points r and r'. 

We have already encountered a similar relation when deriving (71.14). 
Remembering equation (77.07) we may write 

1 d 2 W 

u *= u ° + sHF (m23) 

where U a — U + is a solution of the equation 

AZ7„ = — i-nrya (79.24) 

The second term in (79.23), in which W denotes the quantity 


W = | Y J p'|r-r ’\{dx'f 


(79.25) 


(CO) 


is a retardation correction. 

We consider the first integral on the right-hand side of (79.20). By the lemma 
just proved 

dU„ dTJ„ 


/(' 


Xi- - x k — 

cx/c dxi 


■jo (da;) 3 = 0 


(79.26) 


Therefore we have 


J (*■£”** S' 

On the other hand, for 


8 3 W 8 3 W \ 

- Xu - 1 

8t 2 dx le dt 2 dxt, 


jp ( dx ) 3 


, , | ,. dp dW 

/= iy| r r | ; ? = ■£; * = - T 


dt 


the lemma gives the relation 

/(- 

Using this we get 

1/4 


d 2 W d 2 W \dp 

Xk I ( dx ) S ~ 0 


dt dxic dt dxt! dt 


(79.27) 

(79.28) 

(79.29) 


d 2 W d 2 W 

Xi - Xlc - 

dt dxk dt dxi 


■j(dx) 3 = J p(: 


d 3 W &W 

Xt-r-r-r: - X k 


dxk dt 2 dxi, 


j {dx) 3 


(79.30) 
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and therefore 



1 d f / 

c 2 dt. 


/ 8*W 

1 7*f 

8 2 W 

\ dt dx k 

k dt 8xi 


8 2 W \ 


(79.31) 


so that the first two integrals on the right of (79.20) do indeed cancel. 

The third integral on the right of (79.20) vanishes as a result of the same 
lemma ; one has simply to apply it to 


/= |7- l- r n ; {* = (**; (79.32) 


Finally, the last integral vanishes because Poisson’s equation holds for Z7*. 
Thus the entire right-hand side of (79.20) is zero and we have 


j t J (xtGk - x k G ( )(dx) 3 = 0 

(co) 

If we introduce the total angular momentum of the system, 

Ma = I" (xtGk — x k Gf)(dx) 3 
( 00 ) 


we have 


Mac = const. 


(79.33) 


(79.34) 

(79.35) 


We recall that in these equations the quantity (?< has the value (79.18) and 
differs little from the momentum density p vi. 

We now go on to the formulation of the law of energy conservation for a 
system of bodies. In contrast to the other conservation laws, the law of energy 
conservation can be derived from (79.06) only in Newtonian approximation, 
the accuracy of calculations being the same as for other laws. A more exact 
form of the energy conservation law can be obtained by discussing the Lagran- 
gian form of the equations for the system of bodies. For non-rotating masses 
this will be done in the next section. 

Since the main terms p and p vt in the components c 2 T°° and c 2 T w satisfy 
the equation of continuity (79.01) the relation (79.06) amounts to the same as 

c 2 J V a T«°(dx) 3 = 0 (79.36) 

to the order of approximation considered. Introducing the expression (65.23) 
for the divergence into the last equation we find 


d C C 8U 

- c 3 T 00 (dx) 3 + — T M (dx) 3 = 0 

dt J J ot 


(79.37) 


and, using the value (70.26) for T 00 we get 


j t J pj 1 + + n - U)^j(dx) 3 + 1J P~jf( dx ) 3 = 0 ( 79 - 38 ) 

Since separately 


M° — J p (dx) 3 = const. 


(79.39) 
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we must have 


| J P(i»* + n - U)(dx) 3 +j? 8 §(dx) 3 = 0 (79.40) 


To obtain from this a conservation law in the narrower sense we must trans¬ 
form the second integral in (79.40) into the form of a time derivative. This is 
easily done with the aid of the relation 

,?™-U^)(dx) 3 = 0 (79.41) 




which follows from Poisson’s equation for V. Subtracting half of (79.41) from 
(79.40) and putting 


E = J p(Ju 2 + n - }U)(dx) 3 


we get 


dE n 

a ~~ 0, 


E = const. 


(79.42) 

(79.43) 


The integrals of the separate terms in (79.42) represent the kinetic, elastic 
and gravitational energies of the system of bodies in Newtonian approximation. 
If we do not separate off the terms (79.39) from (79.38) and put 


we also get 



W)\(dx) 3 


M = const. 


(79.44) 

(79.45) 


The quantity M is the total mass of the system of bodies. It is equal to 

E 

M = M° + - (79.46) 

c 

where M° and E are defined by (79.39) and (79.42). 

It remains for us to examine the integrals of the mass centre motion of the 
system of bodies. They may be obtained starting from the relation (79.08). 
This amounts to transforming the expression 

1J»,p{i+ !(*«*+ n-u)}((fa)» 

= J P «(|i + ^(i v2 + n — tojc'fo ) 3 

+ -4 J xtpV}Wj(dx) 3 + J Xi ? ( f f -^ (^ x ) 3 (79-47) 


with the aid of the equations (79.01) to (79.04). Inserting the value for p wj 
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given by (79.02) into (79.47) we get as the sum of the last two integrals 
Iff ill dU\ „ If 1 f 8U 

? J x,? \ vm + li-- sr x> ' - - p J I,m{dx) ’ - ? J , 'a w 

(79.48) 

Using the notation (79.18) we can write the result of inserting (79.48) into 
(79.47) as 

j t J z<p[l + ^(lu 2 + n - U)J(ix ) 3 

= J Gi {dx) 3 — J pv t U (dx) 3 + -^ J P Ui (dx) 3 

1 f 8 2 W If 8U 

+ ^J P 8 ^It {dx)3 - c 2 J X<p H {dx)3 {79 - 49) 

Adding to this the obvious equation 

(&). - ± + t/|)w (79.50) 


we can replace (79.49) by 


j t j x i?{i + ^ 2 (b 2 + n - i-c/)}(ix) 3 

= J ft (^) 3 - ^ J P«^ (i ^) 3 + ^ Jpft (tfa)* 

+s J p S' (4r) * “ i f ‘*‘( p 'w - v? i) w 


(79.51) 


This equation is valid if the region of integration includes one or some of the 
masses, but if the integration is extended over all space it can be simplified. 
By using the Poisson equations satisfied by U and Ui we get 

J p vtU (dx) 3 = J 9 U t (dx) 3 (79.52) 

We also have 

(79B) 

The latter relation can be most simply verified by noting that the function 

8W 

Q { = xtU - 2 — (79.54) 

8xi 

satisfies a Poisson equation of the form 

AQi — — 47rypX( (79.55) 

As a result of (79.52) and (79.53) all integrals on the right-hand side of the 
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equation, except the first, cancel and we get 

' It J + + n “ i ^)}^) 3 = | G t dx )* = p i ( 79 - 56 ) 

where, by (79.15), P< is constant. Therefore equation (79.56) may be written 
in an integrated form, namely as 

| a*p{l + + n - £to)(dz) 3 - tPi = K ( (79.57) 

where K{ is a new constant. Remembering that the total mass M as defined 
by (79.44) is constant we can introduce three quantities X{ by means of the 
relations 

MXt = | z <P jl + !($«» + n - iZJ)}^) 3 (79.58) 

These quantities can be interpreted as the coordinates of the mass centre of 
the system of masses, and equation (79.57), written as 

MX t - tPi = K t (79.59) 

can be interpreted as the law of motion for the mass centre. Equation (79.08), 
which was our starting point, may be considered to be the result of differentiating 
(79.59) with respect to time. 

80. Additional Remarks on the Problem of the Motion of a System of 
Bodies. The Explicit Form of the Integrals of Motion for the Case 
of Non-Rotating Masses 

In the preceding section we introduced the integrals of motion for a system 
of bodies,, assuming the validity within each body of the non-relativistic 
equations of a motion for a continuous medium (79.01) to (79.04). The question 
arises whether the integrals of motion so found are still constant if the equations 
of motion of a continuous medium are taken .to be only approximately valid 
within the bodies, as was assumed in Section 73, and if instead it is only postu¬ 
lated that the equations of motion of the bodies as a whole are satisfied. In 
this case one has, of course, to reintroduce the assumption that the bodies 
rotate as if rigid. 

As regards the momentum, the question just posed is very simply answered. 
The equations of motion obtained in Sections 75 and 77 amounted precisely 
to the statement that the quantities 



Pat = J Gi(dx) 3 

(80.01) 

satisfy the relation 

(a) 


where, by (75.33) 

fc. 

11 

a! kt 

"S 1 

(80.02) 


2 Fat = 0 

n. 

(80.03) 

Therefore the fact that the quantities 



P< = J G ( (dx) 3 

(80.04) 
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are constant, is a consequence of the equations of motion of the bodies as a 
whole. 

The same applies to the quantities 

M ik = J (x { G k - x k G t )(dx) 3 (80.05) 

which represent the angular momentum of the system. In Section 72 we wrote 
down the equations of rotational motion only in Newtonian approximation. 
But the more precise equations for the sum of orbital and intrinsic angular 
momentum are precisely of the form 

| I Wh - = L<f (80.06) 

(a) 

where Uff is the sum of the same integrals that appear on the right-hand 
side of (79.20), except that here the integrals are extended over the region of 
mass (a) and not over the whole of space. Now, as was proved in Section 79, 
the relation 

2 L u = 0 (80.07) 

a 

is valid and hence the constancy of the Mu is a consequence of the rotational 
equations (80.06). 

We now verify that the relation 

j' gVJ«°(dx) 3 = 0 (80.08) 

(a) 

is satisfied. It must hold in order that the corresponding harmonic condition 
be satisfied. On the other hand, if it does hold, so will the relation (79.06) 
which is connected with the energy integral. 

The transformations that took us from (79.06) to (79.38) are also applicable 
in the case when all integrals are extended over the volume of a single mass. 
Therefore we may write immediately 

^ p[l + + 11 ~ U^dx) 3 + I J p ^ (dx) 3 = 0 (80.09) 

(a) (a) 

and since, separately 

f p (dx) 3 = M a = const. (80.10) 

(a) 

we must have 

j t f p(-’ ! ’ 2 + n - V){dx) 3 + j" p ^E(dx) 3 = 0 (80.11) 

(a) (a) 

This relation is fulfilled as a result of the equations of motion of a continuous 
medium (79.01) to (79.04). We must show that they will also be approximately 
satisfied if the equations of motion of a continuous medium are satisfied only 
in the mean and the body is taken to be rotating as if rigid, but not necessarily 
with constant angular velocity (see section 73). In this case our problem 
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reduces to verifying the relation 

J p Vi(w { — ^~j ( dx ) 3 = 0 (80.12) 

(a) 

which follows from (80.11) by differentiation when account is taken of the fact 
that the elastic energy remains constant. Here W{ is the acceleration for which 
the expression (73.07) was deduced in Section 73 ; the integrand in (80.12) is 
v t times the left-hand side of (73.08). The equation (80.12) is readily verifiable 
with the use of the relation 

f T /8 2 U (a) \ 

(a) 

which may be written as 

^ = W—^ 

dt 2 11 \8xi dxjJ a 

This relation coincides with (72.32) and is satisfied by virtue of the equation 
of the rotational motion of the body. The verification of (80.11) may also be 
performed as follows. Dividing the potential V into an internal and an external 
part and using equations (71.19) and (72.09) which express the fact that the 
resultant of the internal forces and of their moments is zero, we may write 

j 9 = ~ / pVi ^ = 0 (80.15) 

Further it is evident that 

d r 

j t I p(n — u a )(dx ) 3 = 0 (80.16) 

(a) 

Therefore (80.11) reduces to an equation involving only the external potential 
U la) , namely 

dC r 8U< a) 

- J p(i« 2 - U^){dxy + J p — (< ixf = 0 (80.17) 

(a) (a) 

Inserting the value (71.27) for f7 <a) (r) into this equation and using the equations 
(71.35) and (72.32) for the translational and rotational motions we see that 
(80.17) is satisfied. 

Thus the validity of the relation (80.08) which must be required to ensure 
that the harmonic condition is satisfied is, in fact, a consequence of the remaining 
relations of the form (70.21), just as in point mechanics the equation expressing 
the conservation of energy is a consequence of the equations of motion. 

Similarly it can be verified that the equations of translational and of rotational 
motion ensure the validity of the relation (79.51) and hence of the law of motion 
of the mass centre. 

As we have already noted in Section 79 the accuracy of our approximation 
permits a derivation of the law of conservation of energy from (79.06) only in 
Newtonian approximation. However, for non-rotating bodies with spherical 


(80.13) 

(80.14) 
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symmetry it is not difficult to write down the energy integral in the next 
approximation also. In this case the motion will be purely translational, and 
for such motion we have derived the Lagrangian form of its equations. There¬ 
fore we can base the derivation of the energy integral on the Lagrangian 
examined in Section 78. We introduce the effective mass 


m a -Ma+\j 2 

(80.18) 

and then have, by (78.03) 


£ = # + - O-'F 

(80.19) 

where by (76.19) 


k =2 + ~2 2 w«< 2 ) 2 

(80.20) 


a a 


As before, the quantity K x is given by the equation (78.05) in which we can 
replace M a and Mb by the effective masses m a and me,: 




1 V Y w « m & ,o • 


a, b 


la — b| 


( 3d* + 3 b\ - 8d^) +^2 WVA | a - b [ 

(80.21) 


a , b 


$ is the Newtonian potential energy for the effective masses. 


1 ym 0 m& 

2 Z la -b| 

a,b • I 


(80.22) 


In view of (78.13) this expression includes the terms O x , and <J> 2 (incidentally 
<J> 2 = 0 because of the spherical symmetry). Finally, X F has its previous value 
(78.15), but in the approximation considered the masses involved may also be 
replaced by effective masses : 


y = 4- y 


Y 2 V m a m bi m a + mb) X 


4c!! ^ 

0 a, b 


X 


(in 


a —b| 
1 


+ 


6 cl 


„2 

0 a, b.o 


a — c 


b —c b — a 


+ 


■=*.) 


Using the relation 


2 -S- £ = £ 


(80.23) 

(80.24) 


we obtain the energy integral (with the non-relativistic choice of the zero of 
energy) in the form 

E = 2 i m a d i + ^ 2 + El + ® 

a a 


(80.25) 
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The remaining integrals of the motion can also be readily derived from the 
Lagrangian. We obtain the momentum integrals 



2 Pai = Pi 

(80.26) 

where 

a 


P a < = 

8L . { ,1 .„ 1 "O' \ 

: arr a A m ‘ + ~wl |«-b|) 

7 y m a nib(di — hi) 1 , (at — 6<)(a* 

+ 2 c>l la — b| la — b 

-bie) 


3 

the angular momentum integrals 

(80.27) 


2 ( a iPak ajcPat) — Mtic 

(80.28) 

and the integrals of the mass centre motionf 



MXi — P f t = K( — const. 

(80.29) 

where 

a b * • 

(80.30) 

and where 

a a. b 1 

(80.31) 

is the total mass of the system. 

A more exact value of the total mass is 



E 

M = 2,m a + - 

(80.32) 


where E has the value (80.25). 


81. The Problem of Two Bodies of Finite Mass 

In the case of two bodies the equations of motion obtainable from the 
Lagrangian (80.19) can be integrated. In this connection it is convenient to 
change to a notation which is more usual in mechanics. To label the two masses 
we shall now use the numbers 1 and 2 instead of the letters a and b, and we 
shall denote the coordinates of the two particles by x lt y u z lt and x 2 , y 2 , 
z 2 respectively instead of using a and 6 here also. 

We shall also use the three-dimensional vector symbols r x and r 2 to describe 
the positions of the two masses. In the new notation the Lagrangian becomes 

L = -niff + \mj\ + !£(*;)* + —(fj) 2 


2 ' 8c 2 ' 
ym 1 m 2 


(3* J + 3r 2 


ym 1 m 2 


yniym^ 


p {i-i • ( r i - *•*)}{(#■* - (r x -r,)} 

y 2 m 1 m 2 (m 1 + m 2 ) 

2c 2 |r, — r.,I 2 


( 81 . 01 ) 


t The constants K f should not be confused with the quantities I\ l and A" a appearing in the 
Lagrangian. 
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and the energy integral appears as 


J_ y m l m t _ .. , 

+ 2c 2 | ri -r 2 | ( 1+ 2 7 1 s) 

- i ji^p {*i • - '.»{*. * ( r x “ f 2 )> 

ym 1 m 2 Y 2 wi 1 m 2 (m 1 -f m 2 ) 

I r i-r»| + 2 c 2 liq-r, 2 


(81.02) 


The equation (80.27) for the momentum of one of the masses should also be 
restated in the new notation. The component Pf\ for instance, is given by 


P..»-»L + -L-q_-L wM 

W 1 2c a 1 1 2c 2 r a —r 2 / 


7 ympyifx^ - *„) yw 1 m 2 (a : 1 - x 2 ) 

+ —rz -~\— --zzzn. -:mr ( r * 1 k* ~ r a)} ( 8L03 ) 


2c 2 iq-r,, 


2c a \r 1 — r 2 


The momentum of the other mass can be obtained from the last expression by 
interchanging the suffixes 1 and 2, and the total momentum will be given by 


P x = i*P + Ff, etc. 


(81.04) 


Further, we state the equation for one of the components of angular momentum, 
for instance 

M xy = xfP™ - Vl Pf + x 2 If - (81.05) 

We have 

/ 1 . 3 y m 1 m 2 \ 

M *y = - y^\ m ' + W nh l + C 2 | ri -r 2 |7 

/ 1 3 ym 1 w 2 \ 

+ (**,- + w m * Tl + ^ 2 kj — r 2 [/ 


7y»i 1 m 2 
2c 2 lr 1 — r 2 


+ 9 J WlOT *. ■ , {ki + *») ■ ki - r 2 )}(*iy 2 - **yi) ( 81 - 06 ) 

lc |r x r 2 | 

If we introduce a system of coordinates fixed to the mass centre we can write 
the mass centre integral as 


■ ki# 2 - yix 2 + ®*yi — y&i) 


(81.07) 
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We now introduce the coordinates of the mass centre in the Newtonian sense 


r„ = 


wi^ + wt 2 r 2 
m 1 + wi 2 


and also the relative coordinates 


Then we have 


r = r, — r. 


m, 

T 1 = r 0 + — r 


m, 


Wl„ 


where 

Wl 0 = wij -j- wi 2 

is the total mass. Further, we introduce the notation 


(81.08) 

(81.09) 


(81.10) 


(81.11) 


m* — 


m x m 2 
»% + wi 2 


(81.12) 


for the reduced mass and note that 

wi„wi* = m 1 m 2 (81.13) 

We also have 

mf \ + wi 2 r 2 = wi 0 rjj + wi*r 2 (81.14) 

m 1 (x 1 y 1 - 2/i*i) + mfr-fh - 1 / 2 * 2 ) = m o(*o //0 - */o*o) + w *(*2/ - 2/*) 

(81.15) 

and x^ 1 — x$ 2 = x$ - 37/0 (81.16) 


Equation (81.07) shows directly that the radius vector of the Newtonian mass 
centre will always remain small and its rate of change will also he small. If 
we take the orders of r and r to be R and q respectively, we have 

(7 2 O' 3 

r o~^ ; 


(81.17) 


Therefore in all correction terms that contain the factor 1/c 2 we may replace 
(81.10) by 



Wl 2 

ri = —r ; 

Wl, 

r 2 --r 

(81.18) 


m 0 

wi 0 

and 

m, 

ri = —r ; 

. Wli . 

r 2 --r 

(81.19) 


wi 0 

m 0 


Making these approximations in equation (81.07) itself we can write it in the 
formf 


wio r o — 


(m t — wi 2 ) 
2c 2 m n 


■Ut . - 


(81.20) 


Hence it can Be concluded that in the case of a bounded motion the Newtonian 


f This form was first derived in the author’s paper of 1941 [38]. 
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mass centre r 0 will oscillate about its mean position. 

We rewrite the integrals of energy and angular momentum using the assump¬ 
tion that r 0 is small. We can neglect the quantities r 0 and r 0 even in the main 
(Newtonian) terms of (81.02) and (81.06) since according to (81.14) and (81.15) 
they appear quadratically in these; this means that the values (81.18) and 
(81.19) may be used everywhere. Introducing these values into the energy integral 
(81.02) and dividing the result by the reduced mass m* we find for the quantity 

E (81.21) 


E 0 = 


mr 


the expression 


Y™o 

r 


+ 


i 1 ~ + (3m ° + + (r ‘ + Y 


2r® '' 1 2rM 

(81.22) 

We now introduce the values of r x and r 2 into the angular momentum integral. 
By (81.16) and (81.20) the last term of (81.06) is equal to zero in high approxi¬ 
mation and the remaining terms give 


23) 


2, _ j 1 + ± (l - ^) f ■ + X (3mo + »•)}(«# - yi) (81. 

Similar expressions result for the other components of angular momentum. 
It is evident from these expressions that in the space of the relative coordinates 
x, y, z the orbit will be plane. Taking the plane of the orbit to be the xy plane 
we can put in the usual way 

z = 0; z = 0 (81.24) 

and we can introduce the polar coordinates r and <p by the equations 

x — r cos <p ; y — r sin <p (81.25) 

Denoting the constant on the left-hand side of (81.23) by p we get 


p — 1 + 


1 (l 

2c 2 \ m 0 /' 


2 + -j~(3 m 0 + m*)r 2 <p 

C*T 1 


(81.26) 


Before continuing we compare the integrals of energy and angular momentum 
just derived with those obtained in Section 58 in our investigation of the 
motion of an infinitesimal mass in the field of a large or finite mass. We must 
expect that for m* = 0 the equations of the two-body problem go over into the 
equations of the one-body problem. Let us verify this. As the equations (58.26) 
to (58.31) show, the following relations held in Section 58 : 



r — a. dt , E 0 

r -f a di c 2 

(81.27) 

and 

(r+a) 2 g = p 

(81.28) 

where 

/dz\ 2 r — a 1 [/r -f a\ / dr\ 2 /cfcA 2 

U) _ r + « c*l(r—«)U) +(r + a) U) 

] (81.29) 
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and where a is the gravitational radius of the large mass ; it may be taken to be 


(81.30) 


Putting 


(*Msr 


we may replace (81.29) approximately by 


whence 


and then 


= r i aw i 
\dt/ r + a |_ c 2 \ r) J 

dt //r + a\ r 1 / 4a\ . 3 ® 4 ] 

^ = V(—a)-L 1+ 2^( 1+ 7> + 87j 


r — a dt , a v 2 3a 3r 4 a 2 

-= 1 - 1 - 1 - v 2 A - 1 - 

r + a dz r 2c 2 2c 2 r 8 c 4 2r 2 


(81.31) 


(81.32) 


(81.33) 


(81.34) 


Introducing this expression into (81.27) and replacing a by its value (81.30) 


we obtain for E 0 the value 


1 vm 0 1/3 3 Y»« n y 2 »( 2 \ 

*•“ 2 '+ + +V) (8L36) 

which agrees with that given in (81.22) if in the latter m* is equated to zero. 

In order to compare the equations (81.26) and (81.28) we separate off the 
factor r 2 <p in the latter and write 




dt „ dtp 
— r 2 ^-=n 
dx dt 


(81.36) 


The value of dt/dx may be taken from equation (81.33) in which terms of order 
c*/c 4 may be neglected. Then we get 


/ a\ 2 dt , 3a v 2 
( 1+ r) *- 1+ 7 + 2^ 


(81.37) 


Here we replace the gravitational radius a by its value (81.30) and insert 
(81.37) into (81.36), so getting 

(, v 2 t 3ym 0 \ d 9 




)r 2 ^ = 
/ dt 


(81.38) 


If m* is equated to zero in (81.26) that equation agrees with (81.38). 

Thus in the limit the integrals of motion in the two-body problem do indeed 
go over into the corresponding integrals of the equations of the geodesic which 
determine the motion of an infinitesimal mass in the field of a finite mass. We 
note that this comparison has been possible only because we have treated both 
problems in terms of the same, namely harmonic, coordinates. 

We now continue the discussion of the equations for two finite masses and 
derive the equations of the trajectory of their relative motion. We shall retain 
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the symbol v 2 in place of r 2 to denote the square of the velocity. If we put also 

u = - (81.39) 

r 

we have the identity 




+ n' 


(81.40) 


whence, after multiplying by the square of the quantity that appears multi¬ 
plied by r 2 cj> in (81.26) we find 

3 m*\ 2v 

-lu 4 + — 


•'+? (' 


m 0 / 


d—7T (3»»o + m 


c‘r 




+ w 2 (81.41) 


We can now insert on the left the value of v 2 given by the energy integral 
(81.22) with (r • r ) 2 expressed as 

(r -r ) 2 = r 2 • u 2 — [a 2 (81.42) 

This insertion reduces (81.22) for E 0 to 

E. = - „■ - + - 

0 2 r ' 8c 2 

(81.43) 

We solve this equation approximately for v 2 and insert the solution into (81.41). 
Replacing 1/r by u we then get 


(i-—b 

\ m 0 J 


4 + i(3m° + 2m> 2 + ^-^ 


UduV 
l \d<D/ 


+ 


= 2£ " + r'( 1 “^*) E! 

+ 2y».jl i 

+ , + r»* 

c 2 \ m 0 / c 2 

For m* —- 0 this equation goes over into (58.36) in the approximation con¬ 
sidered. To make the discussion of this equation more convenient we replace 
the integration constants E 0 and u. by two new constants a and p, where 


£o = 


Further we put, as in (81.30): 

Y m o 


Y«V 

~2^'’ 


= a; 


y = v n oP 


Y nr 


(81.45) 


(81.46) 


In Newtonian approximation the quantities a and p represent the major semi¬ 
axis and the parameter of an ellipso (we confine ourselves to the case of a 
bounded motion). With the new notation (81.44) appears as 



(81.47) 
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For a > p the polynomial on the right-hand side of this equation has two 
positive roots u x and w 2 that are close to the roots of the equation 

1 2 

- h-M- M 2 = 0 (81.48) 

ap p 


which are 


with 




(81.49) 



(81.50) 


In addition there is a large root u z for which one easily finds the relation 

. , 6a — a* 

a*« 3 = 1- (81.51) 

p 


The differential equation (81.47) can therefore be written as 


/ du\ 2 

— u)(u — 

/ 

6a — a* „ \ 

U) - K ■ 

m 2 )^1 

- : -a*u I 

V / 

We make the substitution 




Ml + «2 


— U 2 , 

u 

2 

+ — 

2 «» + 

Tien the equation for <{; is 




m 

A 2 ! 

6a 

a*e 


- = 1 - 

— 

-cos d 

\d<\ 

>/ 

P 

p 


(81.52) 


(81.53) 


(81.54) 


Here we have introduced the approximate values for the roots given by (81.49) 
into the correction term. Hence 


d(p , 3a 

tt = 1 d- 

dty p 


+ 


a*e 

—— COS di 

2 p Y 


(81.55) 


If r varies from its greatest to its least value and back, tj/ changes by 2re. In 
that time the angle <p increases by somewhat more than 2 tc, say by 2rc -f A<p 
where 

Acp == (81.56) 

V 

This equation has the same form as the equation (58.43) which gives the dis¬ 
placement of the perihelion in one period of revolution, but the constants 
involved have a somewhat different meaning. For a given parameter p the 
displacement depends only on the sum of the masses of the two components 
of the system (e.g. a double star). Of the constants of integration only the 
angular momentum is involved in this expression for the displacement. 

The presence of the cubic term in the differential equations (81.47) has the 
effect that the orbit of relative motion is not a precessing ellipse, but a proces¬ 
sing curve of a more complicated form which, however, does not differ greatly 
from an ellipse [19]. 



CHAPTER VII 


APPROXIMATE SOLUTIONS, CONSERVATION LAWS 
AND SOME QUESTIONS OF PRINCIPLE 


82. The Gravitational Potentials for Non-Rotating Bodies (Spatial 
Components) 

In the preceding chapter we determined the gravitational potentials only 
with the accuracy required to derive the equations of motion of a system of 
bodies. We shall now concern ourselves with a more precise evaluation of the 
gravitational potentials, but in view of the complexity of the problem we shall 
only deal with the case of non-rotating spherically symmetric masses. 

In Section 67 we stated the equations 



1 4(7 4 S 

(82.01) 


c 3 c 6 

(82.02) 

and 

= —c8uc + ^ 
c 3 

(82.03) 


in which U is the Newtonian potential and the gravitational vector potential. 
The terms involving S, Si and £<* are corrections. We did not evaluate these in 
detail because it was sufficient for the derivation of the equations of motion to 
know, in addition to the vector potential Z7<, the single quantity 

U* = U + -JS + S kk - 2U i ) (82.04) 

c t ' 

(see equation (67.11)) which, by (68.30), satisfies the equation 

1 8 2 U* 

4 ^ 2 T°° + Z™) (82.05) 


Now we shall also evaluate the correction terms, taking as our starting point 
the approximate form of Einstein’s equations obtained in Section 68. 



1 S 2 <f v 
2c 3 8t 2 


_j_ jyiiv _ ytiv 

c 4 


(82.06) 


We consider first the equation for the Q ik . In it we retain terms of fourth order 
in 1/c, but discard all terms of higher orders. 

According to (66.07) we can then.put 

— gT ik = pViVk — pm 
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(82.07) 



Approximate Solutions 

and in the expression (68.16) for N ik we can take the main terms 


Here we have put 



Qik = i 8<*(grad U)~ 


8U 8U 

8xi 8xt 
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(82.08) 


(82.09) 


as in (71.16). We can neglect the second derivative with respect to time in the 
equation for Q ik . Introducing the value (82.03) for Q ik , multiplying by |c 4 and 
transferring the term Qik to the right-hand side we obtain the following equation 
for S ik : 

A S ik = Qa — iTC'{{pv t Vk — pa) (82.10) 

We note that the gravitational vector potential TJt satisfies the equation 

AI/< = — ivrfpvi (82.11) 


and that, since the Newtonian potential U Satisfies Poisson’s equation we 
have 


8Qtk . dU 

" 4 " TP as 


(82.12) 


Therefore equations (82.10) and (82.11) lead to 


8Ui dffjA _ ^ f 8(pVj) 8(pv { vk) _ 8U ^ dpa) 
~8t ' ~8xk) 8t 8x k P 8xi ~8xicl 


(82.13) 


and by virtue of the equations of motion for a continuous medium written 
in the form (66.13) we get 


a(^‘ + —) 
l 8t T 8xkJ 


= 0 


(82.14) 


Since this last relation holds throughout space we must have 

SUi BSik _ 0 

8t 8xk 


(82.15) 


i.e. the harmonic conditions are satisfied. When we obtain the explicit expres¬ 
sions for the Sik we shall be able to verify by direct substitution that (82.15) 
is valid. 

We shall seek to find the functions Sik in the form 

Sik = Ua + Vtk (82.16) 

where Ua and Va satisfy the equations 

A U t k = — 4:TVf(pViVk — ptk) (82.17) 

and 

A Va = Qik (82.18) 

Inserting the expression 

U = 2 u a ( 82 . 19 ) 
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for the Newtonian potential into the equation (82.09) for Q ik we can write 


wheref 


and 


Q ik = 2 or + 2 QT 

a a*b 


QfiT 


|8ij;(grad u a ) a — 


8u a 8u a 
8xi 8x k 


Qik b) = |S«(grad u a ■ grad u b ) 


8u b 8u a 8u b \ 
8x k ^ dXk 8xt) 


(82.20) 


(82.21) 

(82.22) 


In accordance with (82.20) we assume that the solution F<* of (82.18) is of the 
form 

v ik = 2 V + 2 V (82.23) 

a a^b 

with the separate terms satisfying the equations 

AF<| a) = Qfif (82.24) 

and AF<f> = Qfi*> (82.25) 

We shall find explicit solutions of these last equations for the case of non¬ 
rotating, spherically symmetric masses. In this case the Newtonian potential 
of mass (a) in the space outside that mass is 


u a = r - Ma t ( 82 . 26 ) 

|r-a| 

and the expressions (82.21) and (82.22) for the Qfy a) and the Qfy b) take on the 
form 


(82.27) 


(82.28) 

These expressions are valid outside the masses. Within the masses one must, 
strictly speaking, use the more accurate expressions (82.21) and (82.22). 

However, the difference between the exact and the approximate values of 
the will not appreciably influence the values of the Fit, at least in the 
case that the linear dimensions of the masses are small in comparison with their 
mutual distances. Therefore we shall take the quantity Qf b) appearing on the 
right-hand side of (82.25) to be given by the expression (82.28) throughout 
space. At the points r = a and r = b this expression has singularities that are 


Qf? k a) = 

QfiT = fM a M b \ 


Si* 


{x t — a { )(x k — a*)! 


r — a| 4 

(xj - aj)(xj - b } ) 


r —a 


Si* 


r — a[ 3 |r —-b| s 

_ (a?j — ai)(x k — h k ) + (xt — b { )(x k — a k ) 
|r — al 3 |r —bl 3 


f In Section 71 the quantity now called Qvjf 1 was denoted by qf. 
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no higher than of dipole character. Consequently, the equation (82.25) for 
V[ a k b) will have a solution which remains finite throughout space, including the 
points r = a and r = b and which tends to zero at infinity. 

This solution can be given in closed form. To do this we express the quantity 
(82.28) in terms of derivatives of the function l/|r — a[ |r — b| with respect 
to the parameters at and bp We have 


Q% b) = Y 2 


M a M b 


%ik 


8 2 


8 2 


dctj 8bj Sat 8bk 8aic 8bi 


;) |r - a| |r 


Therefore (82.25) may be reduced to the simpler equation 

1 


Acp = 


[r — a I |r -b| 
Indeed, if 9 is a solution of (82.30) then the quantity 


(82.29) 

(82.30) 


ycaw = U d *9 _ g2 ? _ g2 y ) 

ik 2 \ daj 8bj Sat 8b k dak 8bJ 


(82.31) 


is a solution of (82.25). 

Now, it is simple to write down the required solution of (82.30). We denote 
by s the perimeter of the triangle whose vertices are at the points a, b and r : 


s = |r — a | -f [ r — b | + [a — b | 

Then the function 

<p = logs = log (|r -a| + |r — b| + |a -b|) 

satisfies (82.30), for we have 

A log s = -1-ri-r- 

11" a| |r-b| 

Therefore the required solution of (82.25) has the form 

y (ab) = y 2 M a M b / 8 2 log s _ a 2 log s _ a 2 log s \ 

ik 2 \ ik 8a] 8b) 8at 8b * 8at 8bJ 


(82.32) 

(82.33) 

(82.34) 


(82.35) 


It is easy to verify that this expression remains finite everywhere and tends to 
zero at infinity. 

We now turn to the equation (82.24) for V\ k a) . Shifting the origin of co¬ 
ordinates to the point a and using the spherical symmetry of our problem we 
can write the equation as 

AFr = (i»« - (82.36) 

For brevity we have put 

u a = u(r) (82.37) 


We try to find a solution of (82.36) of the form 

V ( ( a k a) = — 8 ik q 0 {r) + (XiXk - l8 iK r 2 )q 2 (r) (82.38) 
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Then q 0 and q 2 must satisfy the equations 



£+ ; *3?~J«w 

(82.39) 

and 


(82.40) 

Hence, using the boundary conditions, we obtain 



r 

w 7 

ra'(r) 2 dr + — J r 2 u'(r) 2 dr 

0 

(82.41) 

and 

S.W = l _ 

[ ^ w'(r) 2 dr + J rV(r) 2 dr 

(82.42) 


r 0 


If the radius of the body is L we have for r > L : 

u(r) = 1^? (82.43) 

T 


and the expressions for q 0 (r) and q 2 (r) reduce to the following : 



£1* 

+ 

1-1 

II 

'vT 

Cm 

(82.44) 

and 

W-X y* M l 

4r 4 5r 5 

(82.46) 

Here 

L 

Xa ~ L Y wJ rV(r)Mr 

(82.46) 


is a length of the order of L and the quantity 

CO 

e “ = g~; J (g rad u a)\dxf = A J rV(r) 2 dr (82.47) 

o 

is the negative of the gravitational energy of the body. If we insert the expres¬ 
sions obtained for q 0 {r) and q 2 (r) into (82.38) we get 

Fg»> = ^ - ~~ (*«** - h W 2 ) (82.48) 

or if we return to our original variables by replacing by x < — a*: 

, pm - ~ a ^ x * ~ °*) i Sjl 

Vi * 4|r — a| 4 * F=^[ 

- 5 |y Mg a‘ji [(*» - ««)(** - «*) - J S<t(r - a) 2 j 


(82.49) 
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Here the last term is small compared to the first as long as the distance from 
body (a) is great compared to its radius. Strictly speaking that term should 
be discarded because the same approximation has already been used in cal¬ 
culating Fj£ w . 

It remains to write down the solution of (82.17). For non-rotating masses 
we can put 

Vi — d t ; puc = - p$tk (82.50) 

with p determined from the equations 

dp — p du a (82.51) 

whereby, according to (74.24) 


We then have 


J p(dx) 3 = | e 0 

(a) 


*■= 2 ^+ 2 ^ 


yea 


(82.52) 

(82.53) 


Here the dipole terms are zero because of the spherical symmetry of the bodies 
and terms of higher order in LjR (i.e. of the same order as the last term in 
(82.49)) will be discarded. 

We now put 


S (aa) = Y M a d A + ^ M K X i ~ a j)( X k ~ «*) 
** |r — a| 4Jr — a 4 


(82.54) 


and 


g(ab) _ y(ab) _ Y 2 MgMb 
ik ik 2 



8 2 log s 
8aj 8bj 


8 2 log s 
Sat db/c 


8 2 log s 
da/c 8bi, 


(82.55) 


Denoting by the sum 

S(*=2<sr + (82.56) 

arb 

and inserting this expression into the equation 


atk 


- C$(k + — t Sue 
C 3 


(82.03) 


we obtain explicit expressions for the spatial components of the metric tensor. 
These expressions are also valid in the region of the system itself, outside the 
actual masses. 

For the mixed components of the metric tensor we had the approximate 
expression 

0 O< = ^Ui (82.57) 


where Ut is a solution of (82.11) which, in the approximation considered, is 
equal to 




~fM a di 

r — a| 


(82.58) 


(see equation (76.23)). 
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We can now verify that the explicit expressions found for Stic and for 
satisfy the harmonic condition (82.15). We have 


8 Ei + y 8S ^ = y y Mali{ 

8t Z, dxjc Z-, I r — a.1 

a a I i 

Also, using the relation 

8 2 log s (|r — a| + | r — b [ _ |a — b | ) 
da) 8b] 2 r — a| |r — b| a — b| 


(82.59) 


(82.60) 


and the two further relations that,may be obtained from (82.60) by permutation 
of the letters a, b and r, we can verify that 


_ i 

8xic 




Summing this expression over a and b we get 


8Vt dStk 
8t 8xk 




r — a 


(*•* + s) 


(82.61) 



dXk |a — b[ 3 |r — a| 

(82.62) 

Hence, putting 

2 Zi |a — bl 

(82.63) 


atb 1 1 


we have 

v 8S iT V T 0® 

Zw 8x/c Z, 1 r — a 8at 

a^o a * 

(82.64) 

Combination of (82.59) and (82.64) finally gives 



Now, <1> is the Newtonian potential energy of the bodies. Therefore 

,, .. 8d» 

M a dt + —-0 

OQ>i 


(82.65) 


(82.66) 


by virtue of Newton’s equations of motion. Hence the entire expression 
(82.65) will vanish, in agreement with (82.15). By reversing the argument 
(as was done in our paper of 1939[34]) we could have derived Newton’s equations 
of motion (82.66) from (82.15) and (82.65). 


83. The Gravitational Potentials for Non-Rotating Bodies (Mixed and 
Temporal Components) 

We now derive expressions for the mixed components q 0< of the gravitational 
potential to the next approximation beyond that represented by equations 
(82.57) and (82.58). We must put p = 0 and v = i in (82.06) and insert into 
that equation the value of N ot given by (68.15) and the value of T oi given by 
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(66.07), but with the simplification that the stress pa reduces to an isotropic 
pressure p and that the velocities vt = d< within each body are constant. We 
write equation (68.15) for N oi in the form 


N oi = 

c 6 


where 


8U 8U l8Ui 8UA 8U 

Q t = - 3--1- 4 —‘ - — — 

8t 8xi \ dxj dxtJ bx,j 


Equation (82.06) then appears as 


- Aq 0< - — 

2c a 2c s 81* 


— + 4f7)T 0< 


where, according to (66.07), we have for isotropic pressure : 


(c 2 + W)T* = Jp + f t (i« 2 + n + 3C7) + £ 
If we insert into (83.03) the expression 

c * c & 

for g M we can satisfy that equation if we demand that 

A£/< - I ^ - 4tty(c 2 + iU)T oi 


and 






(83.01) 

(83.02) 

(83.03) 

(83.04) 

(83.05) 

(83.06) 

(83.07) 


Since these expressions involve the speed of light as a parameter the quantities 
Ui and St will no longer be coefficients in the expansion of g°‘ in inverse powers 
of c. However, this inconsistency in their definition is unimportant because 
equation (83.06) for TJt coincides to first approximation with (82.11). 

To write down the solution of (83.06) for the region outside the masses we 
must know the value of the volume integral of the right-hand side of that 
equation taken over the volume of each mass. To evaluate this integral we 
recall the relation 


pn — pM 0 + p = 0 (83.08) 

to which equation (73.26) reduces in the absence of rotation. According to 
(74.07) and (74.24) we have 

(" p u a (dx) 3 — 2e a ; f p{dx) 3 = ^s 0 (83.09) 

(a) , (a) 

Hence (83.08) gives the relation 


J pH(da :) 3 

(«) 


3=-a 


(83.10) 


Using these results we obtain 

(c 2 + W)T 0i (dx) 3 = M a dJ 1 


J 

(a) 


.*[1 ■! A- 3f/ ( «>(a)jj | - e a d ; 


(83.11) 
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where ^/ <0, is the external potential introduced in Section 71. Here the evaluation 
is much simpler tfi&n in Section 76 because we do not consider rotations. 

We note that the quantity (83.10) is the internal (elastic) energy of the body. 
Since its gravitational energy is the negative of e a the sum of the two quantities, 
(2/3)e 0 , is the energy that must be taken in calculating the effective mass. The 
latter will bef 

m a = M a + ?- 2 (83.12) 

o c 


in agreement with (76.18) and (80.18). 

Using (83.11) we can write the approximate solution of (83.06) in the form 




M ‘ + vrIK + 50 w (»>] + % 

c c 


+ 


i a* 
2c 2 at* 




(83.13) 


Here the last term represents a retardation correction. 

We now pass to the equation (83.07) for Si. Here the second derivative with 
respect to time A ay be neglected so that the equation takes on the form 

A St = Qi (83.14) 

As in (82.20) the quantity Qi can be decomposed in the form 

Qi= 2 Qfi aa) + 2 (fi ab) (83.15) 

a a*b 

where Q' i aa> is a quadratic function of the first derivatives of the potential 
of mass (a) and Q) ab ^ is a bilinear function of first derivatives of the potentials 
of two masses (a) and ( b ). Corresponding to this decomposition the solution of 
(83.14) can be written in the form 

Si = 2 S<? a) + 2 S<?» (83.16) 

a atb 

with the separate terms satisfying 

A$ ( “ a > = (83.17) 

and AS<“» = (83.18) 

For Q[ aa) we take an expression analogous to (82.21) which is valid even within 
the mass in question : 

Slla du a 
8xi dXj 

For QS b> we confine ourselves to an expression that is correct outside the 
masses and is analogous to (82.28), namely 


Q[ aa) 


= d,{- 


+ 4S y (grad u a ) 2 


(83.19) 


Q(ab) = 1 M If 3d - 4 h ) (^ a i)( x i b i) 

Vi 2f a M b ((3«; [ r _ a |3 |r _b|a 


-j~ (3 hj — 4 aj) 


(xi — a t )(xj — bj) 


+ 4(d< + b ( ) 


{Xj - aj)(x } - b } ) 
|r — a| 3 |r — b| 3 , 
(83.20) 


t It is important to bear in mind the relation (83.12) when comparing the present equa¬ 
tions with those of our 1939 paper [34], 
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To solve (83.17) we express its right-hand side in the form 


$ 00 > = (83.21) 

where the are the same as in Section 82. It evidently has the solution 

-S< a0 > = + 7d ( F<f> (83.22) 

where Vff a) is the previously determined solution (82.38) or (82.49) of equation 
(82.24). 

Passing on to equation (83.18) we represent its right-hand side, i.e. the 
expression (83.20), in the form 


«!“» - i r■*.*.{<!», - it,) ^ + C#, - u j> ^ M 

* ]r~ -"a| 1 ]r ~b| < 83 ' 23 > 

It follows from (82.34) that (83.18) is solved by 

+ (3t ' _4i ' ) lS + (83 ' 24) 


We shall also write down the value of S ( ( aa) outside the masses. To do this we 
must insert the value (82.49) of F ( ( “ o) into (83.22). The terms containing X 0 
rapidly tend to zero at large distances and can be omitted. In the expression 
so obtained for S\ aa) it is convenient to separate off a harmonic term, writing 


where 


£(aa) = - ^ d , Y£g + $““> 
* 3 * |r — a| ‘ 


4 r — a 


4 |r 


(83.25) 

(83.26) 


Insertion of these expressions into (83.16) gives the value of S t which can be 
combined with the value (83.13) of 17, to determine g 04 from (83.05). The 
equation for g 0< may be written in the form 


n IH 


417 , 4 - 5 , 

-3 ' ' 


(83.27) 


where 0, - £ ^ + 3I, “’ <»»} 

+ i r -»l (83-28) 

and S ( = 2^ oo) + 2 S<f b > (83.29) 

a a^b 


Here the quantities 17, and S ( differ respectively from 77, and Si by harmonic 
terms containing e». We note that eo enters 17, only through the effective 
mass (83.12). The sum 5, is a homogeneous quadratic function of the masses, 
whereas 17, depends on them linearly (apart from terms containing U (a \ a)). 
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It remains for us to find an explicit expression for the temporal component 
of the gravitational potential, i.e. for the quantity g 00 . In Section 68 we have 
already written down the equations determinuig g 00 . By (68.43) and (68.41) 
we have 

g 00 = - + -3 0 + ~ V 2 (83.30) 

C C 3 C 5 

where V is a generalized Newtonian potential satisfying the equation 

^ = - ^ (c2 + w T °° (83 - 3i) 

This equation has on its right-hand side the expression 


(c 2 + hU)T° 0 = P + £ (*** +II-iU) (83.32) 

which follows from (66.07) and was used in Section 79. The potential 0 can 
be evaluated in a manner similar to the procedure followed for U in Section 
77, with the simplification that now the masses are taken to be spherically 
symmetric and non-rotating. Using (83.09) and (83.10) we get 
f o M 

J (c 2 + ±U)T°»{dxy = M a + — s a+ ^{vl- U*> (a)} (83.33) 

(a) 

and therefore 

+ iS2r«.|r-»! < 83 ' 34 > 

a 

Insertion of this expression into (83.30) gives g 00 . This concludes the deter¬ 
mination of the gravitational potentials in second approximation. 

We must still verify that the expressions obtained for g 00 and g w satisfy the 
harmonic condition 


og 00 8a 0i n 

8t. 8x i 


(83.35) 


Differentiation of the expressions (83.34) and (83.28) for V and €t gives directly 


8U | 8U { 7 8 sf yM a diU ia) (a) | 1 y M a (. 1 

8t ^ 8xi 2c 2 8xi^p I r — a I c 2 I r — a| \ ** 2 


1 (a) 


Further we have by (83.26) 


In calculating the expression 
0,g(rc&) 1 


is t '«: 

4 8t | r — a 1 2 


<«>(a)\ 
dt ) 

(83.36) 

(83.37) 


x h - ++ h > 

(83.38) 
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we must use equation-(82.60) and two further equations obtained from it by 
permutation of a, b and x ; in addition we must bear in mind when transforming 
(83.38) that log s and its derivatives depend only on coordinate differences, 
so that, for instance 


Id 8 3 \. 


s = 0 


(83.39) 


We can combine all terms in (83.38) which contain |r — a| |r — b|, then 
terms containing |r — aj |a — b| and finally terms containing |r — b| |a — b|, 
getting 


8xt 


+ ; Jf»{ - u, ± + <«, + b,) ^} | r _ a ||._ t | 

+ i **"•"*{-^ + b| 


(83.40) 


Summing this expression over a and b (omitting the term a —b) and adding 
to (83.37) summed over a, we get 


dS{ _ 7 8U 7 ? y yM a diU ia) (a) 

8x{ 2 8t 2 dxi ^ | r — a j 

, ^ y M a (ldU<‘>(a) /3?7<“>(r)\ \ 

_1 ~Vl r—a l^ dt ^ ' 8x * /«' 


(83.41) 


Now if in equation (83.30) U is replaced by U in the correction term the left- 
hand side of the harmonic condition (83.35) differs only by the factor 4/c 3 from 
the expression 


dt + 8x t + c 2 \2 U 8t 





8U ia) (r)\ ) 

dx t )J 


(83.42) 


which is obtained by adding to (83.36) the expression (83.41) divided by c s 
and transferring one term to the left-hand side. 

But we have 


M a 


( 


8U^(r)\ 
dxt J 


a 


so 

dot 


(83.43) 


where O is the Newtonian potential energy 


O = - 


1 

2 


2 r 

a*b 


M a M b 

l a - b l 


Therefore, by virtue of Newton’s equations of motion, we have 


(83.44) 


ai = 



(83.45) 


and the right-hand side of (83.42) vanishes. Thus the harmonic condition 
(83.35) is satisfied. 
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In order to obtain for the gravitational potentials not too complicated 
explicit expressions which remain valid in the region of the masses (though 
outside the actual masses) we were obliged to limit the discussion of this and 
the preceding section rather drastically. We had to assume that the masses 
were spherically symmetric and that they did not rotate. In the following 
sections, in a discussion of the gravitational potentials at large distances from a 
system of bodies, we shall free ourselves from these limitations. 


84. Gravitational Potentials at Large Distances from a System of 
Bodies (Spatial Components) 

In this section and the next we shall derive explicit expressions for the 
gravitational potentials which are valid at moderately large distances from a 
system of bodies. We call distances “ moderately large ” if they are large 
compared to the dimensions of the system yet small in comparison with the 
lengths of the waves radiated by the system (see Section 64). Our assumptions 
concerning the internal structure of the system will be as general as those used 
in Section 79 in deriving the integrals of the motion for a system of bodies. 

We begin with the determination of the spatial components g 4 *. As in Section 
82 we have 

9 lk — — + ^5 Sac (84.01) 


where, as in (82.16) to (82.18), 

Stk — Vue -f Vne 

and the functions Va and satisfy the equations 

AC/<* = — 4ny(pt > t v k — pm) 


and 




(84.02) 

(84.03) 

(84.04) 


The right-hand side of (84.03) differs from zero only within the masses and, 
using mathematical terminology, it has moments of all orders (see Section 
70). This means that if it is multiplied by a product of coordinates of any 
order and then integrated over all space, the integral is convergent. It is, there¬ 
fore, possible to write a solution of (84.03) valid outside the system of masses 
as a series of spherical harmonics, i.e. as a multipole expansion. We shall take 
as the origin of coordinates some point within the system of masses, not 
necessarily their mass centre. We obtain 

V lk = U° (k + C7W + ... (84.05) 

where the first two terms are 

= ; j -PuM*) 3 (84.06) 


U tk = 7 ? J (P v t v * - Pm) x M x ) 3 


(84.07) 


and 
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On the other hand the right-hand side of (84.04) is different from zero 
throughout space and only decreases as the inverse fourth power of distance. 
Therefore only a zero order moment exists and the solution of the equation 
cannot be represented as a series similar to (84.05). 

To find a solution of (84.04) we introduce into its right-hand side the integral 
representation of the Newtonian potential 


Y J |r—r'| 

We then obtain a relation which may be written as 

AVik = h 2 SS p ' {dx ' )V {dxy 

{* ik d^d7l 


(84.08) 


8 2 


8x\ 8xl 


a 2 1 

i_ 1 

8x' k 8x"j 

1 r — r' 

r — r"| 


(84.09) 


We know from Section 82 (equations (82.33) and (82.34)) that the function 
logs = log(|r-r'| +|r-r"| + |r' —r'|) (84.10). 

satisfies the equation 


A log s = 


r — r r — r 


(84.11) 


Hence it is simple to deduce that the solution of (84.09) that is everywhere 
finite and that tends to zero at infinity, may be represented in the form 

- 5 Y ’ J J <■' (*'*' H S “ SS - SfSj ■- 5f5?1 <84 - I2) 

We are interested in the value of this expression at large distances. To evaluate 
it we must find an expansion of s and of log s in inverse powers of r which is 
valid for large r and finite r' and r". We have 

. = * + (|r- -r-| + « (V - w) + ''' 

(84.13) 


whence 


logs = log 2r + 1 (lr' — r'j 


+ ri (*; + OK - e\ + & o'* + + 2*W) 


or 

- -U + vl) + • • • ( 84 . 14 ) 


4r 3 

*W 

8 r 4 
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By differentiation we obtain the symmetrized and antisymmetrized second 
derivatives with respect to x\ and x" k . 

1 /0 a logs 8 2 log s\ _ 1 / x ( x k \ 

2 \8x' t 8x\ dx’l 8xJ 4r 2 \ ** r 2 } 

/I X } (Xj + X } )\l S ik (x { —X { )(x k — X A )\ 

\2r + ~~~4r* } [ jF^F 7 ] ( ’ 

and 

1 / g2 l°g« 82 lQ g * \ „ X k X 'j- < x i X 'k ~ X "k ,o 416 x 

2\8x' { 8x" k dx[ dx'J 4 r 3 |r' — r"| 4r 3 |r' — r"| 

Here we have discarded terms that decrease more rapidly than 1/r 2 . We put 
i = l and k — I in (84.15) and sum over l. This gives 

a 2 logs 1 /l xix' + x'!)\ 1 


d 2 logs = J-_A Xjjr'j + sp \ 1 _ 

dx' t 8x" t 2r 2 \r 2r 3 / | r' ----- r" | 

in agreement with the exact relation 

a 2 logs _ 1 It 1 1 \ 1 

8^8^ ~ 2|r — r'| jr — r" | “ 2\|r -r'| + |r -r"| ) |r' -r"| 

(84.18) 

We insert (84.15) and (84.17) into (84.12) and consider first the terms that 
decrease as 1/r. Denoting the corresponding terms in Vtic by V° ik we obtain 

y° ik = JJp' mv ( dx 'r (x j~ ^ (84.19) 

Only the part of the integrand symmetric in x' and x" is important. We there¬ 
fore replace the factor x' t — a*/ by 2®' and perform the integration over x". 
This gives 


(84.17) 


y r 8U 


or, since this expression is known to be symmetric in i and k 

y Cl 8V 8U\ 


y ft 8V 8U\ 7 

-*Jv + 


(84.20) 


(84.21) 


(see also the lemma in Section 79). We note that this expression is also equal to 

F «=-i< M - 22 » 

as was to be expected because Ft* satisfies (84.04) and the volume integral of 
the right-hand side of that equation is finite. 

We now examine the terms in (84.15) and (84.17) which decrease as 1/r 2 . 
Among them are some that are independent of x’ and x". These contribute to 
Vtk the quantity 

(84.23) 


where M is the total mass of the system. 
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The remaining berms of order 1 / r 2 are of dipole character. They are 

n - - $ / P- W »y iX ‘ (84.24) 

But we have the identity 


[v’i x 'M'k - x l ) + - <) - *&(*; - */)] 

+ - <) + ~ <) - 44(4 - x ' } )] 

= (*; + *';)(*; - <)(*; - 4) (84.25) 

in which the two expressions in square brackets on the left are related by the 
interchange of x’ and x". When performing the integration in (84.24) we can 
therefore replace the factor (84.25) by twice one of the square brackets. We 
then perform the integration over the variable that enters the square bracket 
linearly, so obtaining 

F » - 2/( w £+ x > x ‘ Z - Z) p {dx) ' (84 - 26) 

We now combine those terms in Uuc and Va that have the same multipole 
character. Putting 

SI = U« ik + V° ilc (84.27) 

and adding (84.06) and (84.21) we get 

^ = i I( 2 « 2j> <*+ p ( Xi H + Xk £)K (84 - 28) 


Writing the term involving pa in the form 

dp]k 


—2 ptk(dx ) 3 = J (xt 


+ x t ^)(dx) 3 
oxj oxj / 


and using the equations of motion for the internal problem 

8U dpji 

we obtain from (84.28) 

S ik = Yrj ?{X{Wk + + 2 v i v k )( dx ) 3 


or 


S° = 

°ik 


T 


d 2 


2 rdi 2 


ax t x k (dxf 


(84.29) 


(84.30) 


(84.31) 

(84.32) 


Thus, the quantity S 3 k has been finally transformed to a form involving the 
second time derivative of the corresponding moment of inertia. 

We now transform the quantity 

#£ = U % + V % 


(84.33) 
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From (84.07) and (84.26) we get 

= p J{ 2 PWy - 2 Pa x i + ?[ x r 


8U 

X < 8x k + ** 


(84.34) 


Using the identity 

8 , , . _ dpks , dpis 8p js 

— (xjxipto + X]X k pu — x { x k p ]t ) = 2 Xjp ik + XjXi -b XjX k — - XiX k — 

CXg OXg OXg OXg 

(84.35) 

we represent the integral involving pt k as 

—2 J p ik X](dx) 3 = J|zjz< + xfx k ~~ — z#* ^j(dz) 3 (84.36) 


With the aid of the equations of motion (84.30) we can then write 

S \k — P(^] v i x k + — z j z (fc w J )(dz) 3 (84.37) 

or s ii = J p(W* + W< - x^vfidxf (84.38) 

Hence, the equation 

= -8© + A© + V< k (84-39) 

gives us Sac and if we insert this value of St k into (84.01) we obtain the following 
final expression for g <fc : 

2v d 2 r 

8 1 * = _ c 8 ik + — — I P X{X k (dx ) 3 


+ ^ J p(xjXiV k + ZjZfci^ — XiX k V])(dx ) 3 + Y M ^ Xk (84.40) 

We have obtained explicit expressions for the spatial components of the 
gravitational potential which are valid at large distances (in the sense defined 
above), from the system of bodies. Among the terms containing c 3 in the 
denominator those have been omitted which decrease more rapidly than 1/r 2 . 

We note that for a single concentrated mass the expression (84.40) reduces 
to that obtained by rigorous solution of the gravitational equations (equation 
(58.12)). 


85. Gravitational Potentials at Large Distances from a System of 
Bodies (Mixed and Temporal Components) 

To determine the mixed components at large distances we return to the 
equations given at the beginning of Section 83. We have 


tv Ag w 
2 c a 


2 c® dt 2 c 6 v 


^(e 2 + 4 U)T<* 
c 4 


(85.01) 


Qi = _ 3 — — + 4(— 4 - ^i) — 

8t 8xi \ dxj dxi) dxj 


where 


(85.02) 
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and, according to (66.07) 

(c 2 + 4E/)T W = + — (\v 3 + II + 3f/)j — i pucvic 


As before, we put 


4Z7< 4S t 


c 3 c 3 


and subject Vi and S t to the equations 


and 


AU * + W)T« 

AS { -~—^Qi 

c 2 8t 2 


As in (75.18) we introduce the quantity 

w = j P '\r -r'\(dxy 


and also 


We can then write 


Wi-ItJ (P'.j'jr -r'|(,i')> 

Pl _ T J«Aii^!lW)- + . 1 a ’ w ‘ 


c 2 8t 2 


(85.03) 

(85.04) 

(85.05) 

(85.06) 

(85.07) 

(85.08) 

(85.09) 


Here the last term is a correction for retardation. We now perform the expan¬ 
sion of the integral into multipoles and confine ourselves to the first two terms 
and an approximate value of the third term. This gives 



(c 2 + 4V)T ot (dx) 3 + ~J (c 2 + W)T M X]{dx) 

- J pv { XjXic(dxy 


1 8 2 W t 


8xj 8xk 2 r . 


8t 2 


(85.10) 


In this expression terms containing the momentum and the angular momen¬ 
tum of the system can be separated off. According to (79.19) and (79.34) 
those constants of the motion can be expressed in terms of the function (?< 
which, according to (79.18), is related to the quantity in (85.03) by 

(c‘ + W)T« = G, + t(w l+ ^i) (85.11) 

We then have 

Pi = J Gi{dxf (85.12) 

and Mac = J (xiGk — XkGi)(dx) 3 (85.13) 

We shall perform the transformation indicated after we have evaluated S{ 
because a number of terms in this quantity can be combined with terms of Vi. 

In the equation (85.06) for Si we again neglect the second time derivative and 
write 


AiSj = Qi 


(85.14) 
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Just as was done in (84.09), Qt can be written as a twofold volume integral 


«, _ T >/J W (*OWp'{- 5 (5^5 + 5^f) 

a 2 7/ a 2 a 2 \> l 

^ ^ ik 8x' s 8x" 2 \8x' t 8xl ^ x k |r — r' | |r — r'| ^ ^ 

whence we immediately obtain the solution of (85.14) as 

s ,= t ./J™wp 

Here we must insert the expressions (84.15) and (84.16) and integrate over 
x' and x". 

We first split off the term that decreases as 1/r. It can be written as 

~ y( .r r e*w j _ 


s i = ~{“ 4 J p u i( dx ) 3 +j ? v k 

On the other hand, if the obvious relation 

C 8W 

j P _(*)-_» 


p "* srs; {dx) \ 


is differentiated with respect to time, it gives 
f /8W 8*W 

J ^ \dxi dt ^ k dxi dxk. 
and this enables us to write 


j (da;) 3 = 0 


(85.17) 


(85.18) 


(85.19) 


SJ-lj-lJp^C-Jp^w) (85.20) 

We now consider the terms in (85.16) which decrease as 1/r 2 . Among these 
there are some in which the integrand does not depend on x’ or x". These give 


_ 7y 2 MPi - fMPicXiXic 

t- 4r 2 + 4r 4 


(85.21) 


The remaining terms of order 1/r 2 are of dipole character. Denoting them by 
pS') 1 ', we have 


J t' 3 J| (p»,)' p’ firf (<&') W 

1 rf, jj (pwlV + (&T(t-). 

l T‘ 3 8« J J <P»*)V (*0 W I 


-Iv 2 ^ 


(85.22) 


We write this expression in the form 


T - 5 + «„> 


(85.23) 
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where Aji and Bji are, respectively, the antisymmetric and the symmetric parts 
of the coefficient. 

Ay — — Aji ; By = Bji (85.24) 

We then have 

A]i = ~ i' Y /J {pV{) ' P " \r'-'F\ 

+ \ Y J J (P^)'P" (dx'Y(dxy 

-|t/J (pv>)Y (*0W* (85.25) 

and 

(p^)'p" |p 4 ?m ( dx ') 3 ( dx y 

- \ Y / J (f**)V y^A^ {dx’) 3 (dx") s 
-jJJ( p«*)V ^ **W )W ) 8 

+ I yS« J J (p^)'p" (dx'ndx'T (85.26) 

After some manipulation we can express Aji as 

f If/ 3 2 W 0 8 W \ 

■4„ = -2j PW-*,!/,)(*)■- 5 J p 

(85.27) 

Here we have used the relation 


(85.27) 


8W dW k „ 

-1- - = 0 

dt dxic 


(85.28) 


which follows from the definitions of W and W( in (85.07) and (85.08) together 
with the equation of continuity. 

For the symmetric part Bji of the dipole coefficient we get 

7 r i r / 3TJ dU\ 

Bji = - - J p (X]Vi + x ( Vj)U (dx) 3 + -J px { Xj\VfT 7 —^ - — j(da!) 3 

7 r 8W 

+ 2 §W J p ~dT (dx)3 {85 - 29) 

Here we have used the relation (85.28) and the analogous relation for the 
potentials U and U k . Jt remains to form the sum TJt + (l/c 2 )*3< and to combine 
similar terms. Denoting by U 3 the first term in (85.10) and using (85.20) we 
obtain 

=;J>’ + 4D > r “ * M i0 ' + S)K (86 ' 30) 

But, by virtue of (85.11) and (85.12), this can be expressed as 

1 yPi 

* pL 6 r 


(85.31) 
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where P, i the total momentum of the system, including corrections of the 
order 1 /•}*. We now consider those parts of the dipole terms which have anti- 
symmetn 5 coefficients. It follows from (85.27), (85.11) and (85.13) that they 
are equal 

g{* J (c* + 417)(*,T« - x t T°l)(dx) 3 +1 Ajt j = g M, t (85.32) 

where Mjt is the total angular momentum of the system, including relativistic 
corrections. 

The complete set of dipole terms can be written in the form 

^ 1 ’+^ 1, =g &»+*„) (85-33) 

where f)n is the symmetric part of the coefficient. The use of a dotted symbol 
for it anticipates the result that this quantity can be represented as a time 
derivative of a certain quantity Djt which has a simple physical meaning. 
Comparing the last equation with (85.10) and (85.23) we can express t>n as 

1) ]( = J (c 2 + iU^T^x, + T°h t ){dxf + J B i( (85.34) 


If we insert the value of T w given by (85.03) and the value of Bn given by 
(85.29) we get 


Dji = J p(viX) + VjXi){dxf + -^ J p + n 

— 1 U)(vtXj + vjXi)(dx) 3 


If 1 

-- ~ 2 J {paVkX) + p]kVkXi){dxf + 2 ^ 

r i su sm,, „ 

1 



7 r 8W , 

(85.35) 


+ P S »J 

Adding to this the expression 



i r / 8U 

7t )xw(w-P Wk - 

i£h’=° 

(85.36) 


which vanishes by virtue of the internal equations of motion, 7 and using the 
relation 



(85.37) 


as well as equation (79.04) for the total time derivative of the elastic energy II 
we can write t>n in the form 


t)n = 


dDn 

~dT 


(85.38) 


where 


Dn — 



lU)}(dx) 3 + j 9 W(dx)* 

(85.39) 


The first term in this expression is the moment of inertia of the system of 
bodies, calculated with inclusion of the masses of the kinetic and potential 
energies. 
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Now we can write down the complete expression for XJi + (l/c 2 )N<. According 
to (85.10), (85.21), (85.31) and (85.33) we have 

77 i 1 o t p dI) n 

c 2 r 2r 3 2r® cm 


8 2 


8xj dxk 2 r 


J* pviXjX/c(dx) 


3 7y 2 MP i yHlPicXjXk 0 2 iy< 
4cV 2 4c 2 ? -4 c 2 Si 2 


and therefore 


(85.40) 


4y 2va:7 a 2 2y _ 

9 01 = - 5 - + -yry ~ v—xr -j- 

c 3 r c s r® axj dt c 3 r 


8 2 2y 


a*; 0a:* c®r 


pViX}Xic(dx) 


3 7y 2 MP ( y 2 MPicXiXk 

• * 


4 d 2 Wi 


c°r‘ 


c°r* 


c 6 dt 2 
(85.41) 

It remains to find the temporal component g 00 , which by equation (83.30) 
is expressible in terms of the generalized Newtonian potential U. By (83.31) 
and (83.32) 

s 2 y D 

Pjk 


V = y^- + — MX] +1 

r r 3 2 8xj 8a :* r 


8 3 


Here M is the total mass 


1 


6 8x] 8xjc 8x1 


Y f 1 8 2 W 

- J p x } x k xi{dx) 3 (85.42) 


M = | P [i + I (|« 2 + n-ici) 


(dx) 3 


(85.43) 


which is constant by virtue of (79.45) and the Xj are the mass centre coordinates 


MX] = jx JP 1 + I a** + n - \U) 


(dx) 3 


(85.44) 


which, by (79.59), are linear functions of the time. The quantities Dij are defined 
by (85.39). The last term in that equation, which is proportional to Sjy drops 
out of the expression (85.42). In the octopole moments we have discarded 
relativistic corrections. The last term in (85.42) is a retardation correction. 

Inserting the expression obtained for tJ into the equation 


1 W It) 2 

} 00 = - + — + — 
c c 3 c 5 


(85.45) 


we obtain g°° as 

„„ 1 4yM 

9°° = —I— 
c ch 


4y xj 


c?r 3 


MX] + 


0 2 2y 
dx] dx/c c 3 r 


Djk — 


8 3 


2y 


8x1 8xj 8x k 3 c 3 r 


J pXiX]Xk(dx) 3 


7y 2 M 2 Uy 2 M 2 X,X] 4 8 2 W 

c 5 r 2 cV 4 c 5 dt 2 


(85.46) 


It is easy to verify by direct insertion that this expression, together with the 
expression (85.41) for g 0 * satisfies the relation 

a 9°° , = Q 

dt 8xt 


(85.47) 
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exactly. In the equation (84.40) for the spatial components, which may be 
written as 


a 2 2y r 

9 lk = - cS ifc + — — ox iXk (dx) 3 
ot 2 c 3 r J 

d 2 2v f y 2 M 2 XiX] C 

- — — — p(xjXtv k + — x t x k vj)(dx ) 3 + -——— (85.48) 

O'Xif Ot C T J 0 T 

terms of order 1/c 3 (more precisely g^/c 3 ) have been retained, but higher terms 
have been dropped. If the quantity g 04 is taken to the same accuracy it is easy 
to verify that the relation 


aq 04 aqt* 

8t dxic 


(85.49) 


is also satisfied. 


86. Solution of the Wave Equation in the Wave Zone 

The expressions derived above for the gravitational potentials are valid at 
“ moderately large ” distances from a system of bodies. As was explained at 
the beginning of Section 84, this term is understood to mean distances that are 
large compared to the dimensions of the system, but small compared to the 
length of the waves radiated by the system. If, on the other hand, the distances 
are large compared to these wave lengths one is concerned with the so-called 
“ wave-zone ”. In the wave-zone the term in the wave equation involving the 
second time derivative may no longer be treated as a correction and the method 
of solving the equation must be different. 

In the case of the Solar System “ moderately large ” distances extend as 
as far as the nearest stars, i.e. into such a region of space in which the system 
may no longer be treated as isolated. In practice, therefore, the study of the 
Solar System does not take one beyond moderately large distances. However, 
in some theoretical questions, such as that of the emission of gravitational 
waves, or that of the uniqueness of the solution of the gravitational equations 
one must consider distances which belong to the wave-zone, and are arbitrarily 
great in a mathematical sense. 

Before going on to the solution of Einstein’s equations for arbitrarily large 
distances we shall clarify the notions of “ moderately large distances ” and of 
the “ wave-zone ” with the aid of the example of a simple wave equation with 
an inhomogeneous term 

l a 2 J/ 

Ad*-- —— = — 4 tot (86.01) 

Y c 2 8t 2 v 

The quantity corresponding to 4* in gravitational theory will be the difference 
between g liV and its limiting value at infinity. 

If the “ density ” a is taken to be known the solution of the wave equation 
which is of interest to us may be stated as the retarded potential 



+ -f, [c] rM (dxr 

(86.02) 


J j r — r | 


where 

[c] = a{t’, r') 

(86.03) 

with 

- |r —r'l 
c 1 1 

(86.04) 
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The fact that we choose this solution and not any other corresponds to our 
conception that the system is isolated and that the only sources of waves shah 
be the bodies that make up the system. The exact form of initial conditions 
is of no importance; it is sufficient to assume that the initial disturbance was 
concentrated in a finite region around the system and that we are dealing with 
such times and distances at which the initial disturbance has dispersed. 

We assume that the density a possesses both time derivatives of various 
orders and “ moments ” of various orders, for which we introduce the notation 

do('f) = f <*{t, r')(dx'f 

M T ) = f x t a(T,v')(dx'f (86.05) 

PijfcM = J %°( T > T'){dx'f 

where t is a quantity independent of r'. 

The technique of solving the wave equation used in the preceding sections 
amounted to an expansion of the argument t' of a and of a itself in inverse 
powers of c. Since such an expansion leads to the appearance of increasing 
positive powers of |r — r'| in the integrands this method clearly gives a well 
converging series only for “ moderately large ” distances r. (The region of 
“ moderately large ” distances can, in fact, be more precisely defined as the 
region in which the series converges rapidly.) Each term of the series can in 
turn be expanded in inverse powers of r, and then the coefficients of the expan¬ 
sion will involve the “ moments ” (86.05) taken at t — t. 

But we can instead split off the quantity 

T = t - - (86.06) 

c 

in the expression (86.04) for V which then appears in the form 

t' = T+-(r- |r-r'|) (86.07) 

c 

If then t' and a are expanded in powers of 1/c, only in so far as this quantity is 
not involved in t, we obtain a series of a different form, and one that converges 
for arbitrarily large values of r. 

If in this series we take all terms which have the slowest decrease with 
increasing r, we obtain an expression for (J; which is valid in the wave-zone, i.e. 
for very large values of r. The retention of none but the most slowly decreasing 
terms corresponds to the replacement of the quantity (86.07) by 

= (86.08) 

c 


where n = - ; W{ = — (86.09) 

r r 

is a unit vector in the direction of r. In this way we obtain for i an expression 
of the form 


where the function 


+ = - n ) 
r 

(86.10) 

r / n • r' \ _ , . 

(86.11) 

p.(v, n) = J a H-—,r )(dx ) 3 
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depends on three arguments, namely tlie quantity t and the two angles that 
determine n. We note that if p is an arbitrary function of these three arguments, 
<]; is an approximate solution of the homogeneous wave equation. 

If the moments (86.05) exist we can expand p as follows : 

p.(T, n) = p 0 + — p< + - — — p<* + .. . (86.12) 

C at C 

or 

H-(vn) = go + — R + k !*■<* + ••• (86.13) 

or 2 c 2 r 2 

where the dots denote differentiation of the quantities (86.05) with respect to 
their argument t, or, equivalently, with respect to time. 

The region in which the solution of the wave equation for ^ can accurately 
be given in the form (86.10) is known as the wave-zone. 

When the dimensions of the system are small compared to the length of 
the waves emitted by it, the expansion (86.12) converges rapidly, and its first 
non-vanishing term will be the main one. If this term is p 0 the function ij; will 
be spherically symmetric. 


87. The Gravitational Potentials in the Wave Zone 

We now turn to Einstein’s equations as stated in Section 68. As in (68.13) 
we introduce the quantities 

= (~) {II*. “3 n* e - \yy + \g^L} (87.01) 


where L is the Lagrangian 
£ = 


— + Ivy 


2 V(-5) aP 

and the other quantities involved are defined as follows : 

2a \ 


r-Sr + +qr-<r%) 




2 g V 8x p ' ° dx. 

a log v'(-S'). 


8x 


r = 


Then Einstein’s equations in harmonic coordinates appear as 


(87.02) 


(87.03) 

(87.04) 

(87.05) 


( ~)(R^ - y^R) = - JL g«3 a - 2 - 8 '". + T** (87.06) 

\ c 2 ) K 2 2c 2 a 8x a dXp c i v ’ 

We must investigate the asymptotic form assumed at large distances by the 
solutions of these equations. To do this we consider first the wave equation 


V(- 9) '□ + - 9 a3 = 0 (87.07) 

VJj rx 

and replace in it the coefficients g a(3 by their “ static ” values; for simplicity we 
shall assume that the origin of coordinates is at the mass centre of the system. 
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We can here use the expressions (85.41), (85.46) and (85.48) in which, how¬ 
ever, we discard all terms that decrease more rapidly than 1 jr and of the static 
terms we retain (for the time being) only those of order 1/r. We introduce the 
gravitational radius of a mass by the relation 

a = ^ (87.08) 


and go over to the spherical polar coordinates which are connected with the 
harmonic coordinates by the usual relations (57.03). Then we can write the 
wave equation (87.07) in the form 





S 2 t{/ 

"a 2 


2 s<j, l 
\ Sr 2 r dr r 2 



= 0 


(87.09) 


Here A*if/ is the ordinary Laplacian on a sphere (see equation (57.06)). We are 
interested in solutions of the nature of an outgoing wave. For such solutions 
the term A*ij;/r 2 will decrease at infinity more rapidly than the remaining terms 
in the equation and we can discard it. Then equation (87.09) becomes 


i( 1+ Mg_(g + ?!)-o 

c 2 \ r / S< 2 \ Sr 2 r or/ 


(87.10) 


Here only r and t are independent variables, the angles 9- and <p enter merely 
as parameters. 

We make the substitution 


=/ 

and we introduce in place of r the variable 

r* = r + 2a(log r — log r 0 ) 

where r 0 is some constant. Then, apart from small quantities, we get 

= o 

e 2 8t 2 Sr* 2 


(87.11) 

(87.12) 

(87.13) 


A solution of the nature of an outgoing wave will be 

/ = /( r,n) (87.14) 

where, as before, n is the unit vector (86.09) and t now has the value 

T = *_Ir* (87.15) 

c 

or 

t — t — - jr -f 2a log j—j J (87.16) 

Thus the solution of (87.09) which is of the form of an outgoing wave, has the 
asymptotic form 

+ = ^/(T,n) 


Here t is assumed to be finite while r may be arbitrarily great. 


(87.17) 
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Under these conditions the asymptotic values of the derivatives of <J> with 
respect to the space and time coordinates can be evaluated as if ^ depended 
on them only through the quantity t. Putting 


II 

(87.18) 

we have in the approximation in question 

8x a 

(87.19) 


Neglecting terms of order a/r in comparison with unity we can take the k x 
to be given by 



k a — 1; 

ni 

«i -- 

c 

(87.20) 

In agreement with this we 

may put 




-•Tt 

ii 

o 

¥ = H — 
c 

(87.21) 

whence 

hk* 

- 0 

(87.22) 


The quantities k a are proportional to the components of a four-dimensional 
null vector (in the Galilean space corresponding to the limiting values of the 

fT). 

We now add to the static values of the g iiV in the coefficients of the wave 
equation (87.07) their wave part 6 tIV , putting 

1 4a 

a 00 = — H-b 6 00 

c cr 

q oi = b 01 (87.23) 

gift = — cS« + b lk 

We shall assume of the b'^ that in the wave zone they either have the form 
(87.17) or, at any rate, that they satisfy the condition (87.19). Therefore we 
can calculate the derivatives of the 6 liV according to the rule 

db wl 

—r = kjr* (87.24) 

OX <x 

Since the derivatives of the static terms in g'* v will decrease as 1/r 2 and may be 
discarded, we also have 

da^ . 

~ ( 87 . 25 ) 

The harmonic conditions for the g“ v appear as 

kjf" = 0 (87.26) 

and they may be integrated with respect to t. Since we have already split 
off the static part of the g^ v the constants of integration may be set equal to 
zero and we get 

kj) <* v = 0 


( 87 . 27 ) 
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Hence the mixed and temporal components among the 6>* v can be expressed in 
terms of the spatial components as follows : 



6° 0 = — b ik 

e 2 


(87.28) 


The more precise form of V(~9)l c times the d’Alembertian which may be 
obtained by inserting the values (87.23) for the g 11 * into (87.07) now appears as 

= + <87 - 29) 

But if tp is of the nature of a diverging wave which depends on the space and 
time coordinates mainly through t and which satisfies (87.19), we have 

as a result of (87.27). Therefore the additional term in the d’Alembertian will 
vanish separately and we can discuss the outgoing wave solution as if the 
coefficients of the d’Alembertian in the wave equation (87.07) and in Einstein’s 
equations (87.06) involved only their static parts. By this we have, so to speak, 
freed ourselves from the non-linear parts of those terms in Einstein’s equations 
which contain second derivatives. 

We now pass on to examine the non-linear terms containing first derivatives. 
We cannot discard them straightaway because in the wave zone they decrease 
no more rapidly than 1/r 2 so that they may strongly influence the asymptotic 
form of the solution. But when evaluating them we can introduce the simplifica¬ 
tions that follow from equation (87.25) and from the fact that outside the 
differentiation signs the components of the metric tensor may be replaced by 
their limiting values. Raising and lowering indices by means of these limiting 
values we can write for instance 


k* = ,7°%; b* = gjbf* etc. 
just as if we were dealing with tensors. Using the rigorous relation 


we can then write 


1 

tVi-g) 9 *' ar a 



(87.31) 

(87.32) 

(87.33) 


The quantities (87.03) and (87.04) will then be given by 

= -1 (H>^ + wb* - irb#) (87.34) 

and ( k J>l + hk ~ (87.35) 

It follows from (87.22) and (87.26) that 

Wj = °; yy = 0 (87.36) 

Therefore, in the wave zone the Lagrangian has the value zero 

L = 0 (87.37) 

just as for electromagnetic waves. 
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If we form products of the three terms in (87.34) with the three terms of 
(87.35) we get nine sums, of which, however, only one differs from zero, namely 

n^II^ = ^ (87.38) 

Inserting the expressions obtained into (87.01) and replacing the factor (— gjc 2 ) 
by unity, we get 

w* = i ( 87 - 39 ) 

Here the following properties of are important. In the first place it is 
proportional to the product ky-k' 1 and when it is inserted into Einstein’s equation 
(87.06) it can give rise only to such terms in g>“ and g liV , and also in and 
6 UV , which are also proportional to this product. But if one augments 6 !iV by 
terms proportional to kW, i.e. if one makes the substitution 

b°» + Xi W (87.40) 

then, by virtue of the harmonic conditions (87.26) and of (87.22), there is no 
change in N'^. As a consequence the problem of determining the g^ 7 , including 
the effects of non-linear terms, reduces to the solution of linear equations. 

We put 

- - ^ - m) < 8 ™> 

As will be seen below, this quantity plays the role of the energy density of 
gravitational waves. Equation (87.39) then appears as 

N (87.42) 
c 2 9 

Inserting this equation into Einstein’s equations (87.06) we obtain 

^□9^ = + GgkW) (89.43) 

In forming the mass tensor in earlier sections we entirely neglected electro¬ 
magnetic energy. But we may now take into account the energy of the electro¬ 
magnetic radiation of a system of bodies. Denoting its density by a em , so that 

= 3-(E 2 + H 2 ) (87.44) 

07U 

we get for the electromagnetic energy tensor in the wave zone the expression 

T*‘ = O'JPt? (87.45) 

(We are retaining the symbol T'“‘ for this tensor because in the wave zone the 
entire mass tensor reduces to (87.45), the part corresppnding to the material 
medium being zero.) 

Thus we can write Einstein’s equations for the wave zone in the form 

1 8jtv- , . 

k~D9 iw = -fa&k* 

2c c 2 


(87.46) 
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where a can be taken as the sum of a em and a g if we include electromagnetic 
radiation, and simply as a g if we deal with the purely gravitational problem. 

To solve (87.46) we split off the term in 6 !lv which is proportional to k'W, 
writing 6 !iv as 

6' 1V = h' 1 ' 1 + hk L fc (87.47) 

As has already been noted, the expression (87.41) for a g does not change if 
b ,J -' J is replaced by so that we have 

*= - * w (8L48) 

The relations (87.27) and (87.28) also remain valid ; we may write 

kji^ = 0 (87.49) 

The quantities lk'‘ and h appearing in (87.47) can be subjected to the con¬ 
ditions 

□A Siv = 0 (87.50) 

and 

□A = — { -o (87.51) 


in which, according to the remark made above, we may.replace the complete 
wave operator □ by the “ static ” wave operator C]° (equation (87.09)). 

Then the A 4V satisfy a linear wave equation and by (87.17) we can require 
that they have the asymptotic form 

^ = ^ r fnr,n) (87.52) 


The coefficient 2y/c 3 has been introduced for convenience in estimating the 
order of magnitude of the ; the spatial components f ik will be of the order 
of the kinetic energy of the system. 

Inserting these values for into (87.48) we obtain an expression for a g 
that is inversely proportional to r 2 . The r-dependence of the electromagnetic 
part of the energy density will be of the same form. Therefore the total energy 
density a in (87.51) will also be of the form 


<T = 


c 0 (t, n) 


(87.53) 


The gravitational part a 0g of the function a 0 can be expressed in terms of the 
quantities/' 1 '' by a relation analogous to (87.48) namely 


% = - ihm 


(87.54) 


Inserting the expression (87.53) for a into (87.51) we get for h the equation 

UK = —^<t 0 ( t > n) 
cr 2 


(87.55) 
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The right-hand side of this equation can be considered to be known. Thus not 
only the h but also h satisfies a linear equation. The asymptotic form of the 
solution of this equation will be 




8vr{ log r 


T 

J n ) + 


h 0 (x, n) 


Let us put 


4to J ar 0 (x, n)dx = AS(x, n) 


(87.56) 


(87.57) 


If this expression is multiplied by dO./in, where dQ. is an element of solid angle 
it gives the flow of energy in the time x — x 0 into such a solid angle situated 
in the direction n. The result of inserting (87.57) into the previous equation 
may be stated in the form 

h = —[log r A<f(x, n) + e(x, n)] (87.58) 


where the quantity e(x, n) may be taken as of the same order as A<?(x, n) or 
alternatively it may be omitted, the corresponding terms in the equations for 
g tiv being included in / ttv . 

Inserting the values (87.52) and (87.58) for and h first into equation 
(87.47) for 6 4V and then into equation (87.23) for g uv , we obtain the following 
symptotic expressions for the g* 17 in the wave zone. 


9°° = - + ^~+^f 00 + ~ ( lo ?' ** + *) 
c c 3 r err c 5 r 

Q oi = ^/ 0< + ^ n { (log r AS + e) 

Q ik = _ c s <Jfc _|_ ‘tLfik _j_ ^ M ( m*(log r AS’ + e) 
<?r c 3 r 


(87.59) 


Here the quantities /. w and /°° are expressible in terms of the f tk by equations 
analogous to (87.28) namely 


r = -ff (k ; / 00 = ~I ik (87.60) 

Let us compare the expression for the g 117 in the wave zone with the corres¬ 
ponding expressions for moderately large distances as obtained in Section 85. 
In doing this we can first of all neglect terms involving log r and arising from 
the outflow of energy. As for the remaining terms, we can write them in such 
a form that they go over into corresponding terms in equations (85.41), (85.46) 
and (85.48). To do this we put 


Dik(t) = J pxtxk{dx) 3 


( 87 . 61 ) 
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and then replace t by t and relate f ik to D<*( t) by 
Then the equations 




oo = 1 , 82 A*(t) 

c c 3 r c 3 dxi dx/c r 


g« = 


2y S 2 A*(t) 
c 3 8x* St r 


g« = _ cS<* + — —- 

c 3 St 2 r 


(87.62) 


(87.63) 


go over in the wave zone into (87.59), apart from terms arising from the out¬ 
flow of energy, while for moderately large distances they give the main terms 
of equations (85.41), (85.46) and (85.48). At moderately large distances the 
logarithmic term in the expression (87.15) for t may be omitted, i.e. r* may 
be replaced by r. 


88. Some General Remarks on the Conservation Laws 

Gravitational energy plays a very special role in the theory of gravitation, 
which is quite different from the role of all other forms of energy. It does not 
enter explicitly into the energy tensor but is accounted for indirectly through 
the gravitational potentials. The presence of a gravitational field and of the 
energy related to it reveals itself, as we know, in a change of the properties of 
space and time. Gravitational energy can be separated out in the form of 
additional terms in the energy tensor only in an artificial manner by fixing 
the coordinate system and reformulating the problem in such a way that the 
gravitational field is taken to be superimposed on a space-time of fixed pro¬ 
perties, just as is done in Newtonian theory. The additional terms in the 
energy tensor that correspond to gravitational energy do not possess the 
property of covariance (i.e. they do not form a tensor). This is quite under¬ 
standable because of the lack of uniqueness in the choice of the space of fixed 
properties into which the gravitational field is embedded. On the other hand, 
the difficulties that arise in the question of determining gravitational energy 
uniquely can be related to the fact that it cannot be localized. This property 
of gravitational energy shows itself physically in the fact that a gravitational 
field, cannot be screened. The only way to avoid the action of a gravitational 
field is to remove oneself further from the masses producing it. This can be 
done with an isolated system of masses. The gravitational field produced by 
such a system can be regarded as a local non-uniformity in infinite Euclidean 
space (or in Galilean space-time). With this form of description it is possible 
(neglecting radiation, of which more will be said below) to form the ten integrals 
of Einstein’s equations which correspond to the ten classical integrals of the 
equations of mechanics. Four of these- the integrals of energy and momentum 
—form a four-dimensional vector in the Galilean space-time in which the system 
of masses with its gravitational field is embedded. The remaining six integrals 
form an anti-symmetric tensor in the same space-time; they are the integrals of 
angular momentum and of the motion of the mass centre of the system. 

It is important to note that the existence of the ten integrals of motion is 
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related to the uniformity and isotropy of the Galilean space-time in which 
the system id embedded. If one gives up the requirement of uniformity and of 
Euclidean character at infinity this brings about a violation of all or some of 
the conservation laws of which the ten integrals of motion are the expression. 
From a physical point of view this is perfectly natural, for the isotropy and the 
Euclidean character at infinity express the fact that the system is isolated, 
and one can expect the conservation laws to hold only if the system is isolated. 

One should note another reason why a system of moving masses can never 
be completely isolated in an active sense (i.e. in the sense of giving out energy 
rather than receiving it). This reason is provided by the radiation of electro¬ 
magnetic and of gravitational waves by the system and, possibly, also of other 
kinds of waves. However, for a system like the Solar system the loss of energy 
by radiation, even over geological periods, is very small in comparison with the 
available store of energy. This is so in spite of the fact that in absolute measure 
the loss appears an impressive one (for the Sun the radiative power corresponds 
to a transmutation of four million tons of matter into radiation per second). 
As for the radiation of gravitational waves, it is completely negligible : a rough 
estimate using equations based on the results of the preceding section shows 
that the power radiated by the Solar system in the form of gravitational 
waves is 10 23 or 10 24 times less than that radiated electromagnetically and is, 
all in all, about a single kilowatt. (Such an estimate will be made in Section 90.) 
Therefore it is permissible to neglect the action of gravitational waves in all 
but, possibly, purely theoretical considerations. This result shows in particular 
that the problem studied in Chapter VI of the motion of systems of gravitating 
masses may be considered as a problem in mechanics, disregarding radiation, 
not only in the approximation in which it was solved there but also in further 
approximations, which otherwise present no practical interest. 

89. Formulation of the Conservation Laws 

In a theory working with Galilean space-time the classical conservation 
laws can be stated in differential form, namely in the form of relations (31.10). 
The generalization of these relations is 

= 0 (89.01) 

and, according to (41.25), this can also be written as 

VTg) + r ^ afl = 0 (89 - 02) 

However the relations (89.01) do not by themselves lead to conservation laws. 
The mathematical reason for this is the presence of the second term, standing 
outside the sign of differentiation, owing to which, one cannot, in general, 
conclude that any quantities of the form of volume integrals are constant. 
(See Section 49.) The physical reason is the fact that the gravitational field 
itself possesses energy which does not enter T^ explicitly, but which must, 
nevertheless, be included in the general balance. 
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To obtain a relation in which gravitational energy is explicitly included we 
consider Einstein’s gravitational equations as stated at the beginning of 
Section 87. 

We have 

( z|Wv _ Ig^R) = _ J_ g«3 (89.03) 

\ c 2 ' 2c 2 J dx a axp v ' 

where is determined by equations (87.01) to (87.05). On the other hand 
we have, in a harmonic coordinate system, 

02 „ 02gqan g a 3v 

-.ait ^Qv\ “ o o /on a 4 \ 


dx a 8x & 


( 9 «P gti v _ g«*g3v) = g*& 


8x a 8x^ 8x,p 8x a 


(89.04) 


Multiplying this equation by l/2c 2 and adding it to the previous one we get 

(- f 2 ) - is^R) + (tja3 ^ v - a“' 1 fl Sv ) = ( 89 - 05 ) 


where 


, T 1 da** 00 ^ 

2 /v _ __ a a 

2 c 2 8xq 8x a 


(89.06) 


Equation (89.05) would be an identity in an arbitrary coordinate system if 
Z> v was understood to mean 


r„v _ yu» I 1 / g 9 atl 8 9 pv gg ap ^ eg* 1 *^ 

2 c 2 \ 8x„ 8x 8x R 8x„ 


8x a 8x &/ 


(89.07) 


This relation goes over into the preceding one if the coordinate system is 
harmonic. To prove equation (89.05) one has to use the complete expression 
(D.87) for the Einstein tensor which is derived in Appendix D. 

Using Einstein’s equations 


and putting 


&7CY 

- ig^R - - AT** 

c 2 


l <t\ c 2 

j7av _ ( £_ I p\vt _l_ ip» 

\ C 2 / 87 tY 


(89.08) 


(89.09) 


we can write equation (89.05) in the form 


8x a 8p 3 


(g«P fl |XV _ QWgPv) = 16TtyC/^ 


(89.10) 


Here we have on the left an expression of the form of the Krutkov tensor 
discussed in Section 31; the sum of the derivatives with respect to x^ of the 
left-hand side of (89.10) is identically zero. We therefore have 


(89.11) 
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The set of quantities U ir ‘ is not a generally covariant tensor. It is a tensor 
only with respect to linear transformations ; in particular f7 |iv is a tensor in a 
harmonic coordinate system. Adopting the somewhat artificial point of view 
that was described at the beginning of the preceding section the second term 
in (89.09) multiplied by c 2 can be interpreted as the energy tensor of the gravita¬ 
tional field and the first term multiplied by c 2 as the energy tensor of the material 
media and all fields other than gravitational. If other than harmonic systems 
of coordinates are also admitted such an interpretation will not be completely 
unique if the region of the mass system is considered. But at large distances 
from the masses, where space-time is nearly pseudo-Euclidean and the co¬ 
ordinates are, by choice, Galilean, the physical meaning of the U v -' 1 becomes 
unique in any case. 

The conservation laws in integral form which follow from (89.11) are also 
obtained uniquely and do not depend on the arbitrariness connected with the 
deviation of the coordinate system from the harmonic within the region of 
masses. As we shall see, this is due to the fact that the volume integrals 
expressing the energy, the momentum and other quantities may be trans¬ 
formed into integrals over surfaces surrounding the masses. 

We now go on to derive the integral form of the conservation laws. To do 
this we multiply the left-hand side of (89.11) by the Euclidean volume element 

dx y dx 2 dx 3 — (c lx) 3 (89.12) 


and integrate over some sufficiently large volume, including the system of 
masses. For the time being we leave the dimensions of the region of integration 
undetermined. 

Using the Gauss-Ostrogradsky theorem we obtain equations of the form 


and 


J U°°(dx) 3 = - 

jn t V « dS 

(89.13) 

|" U 0i (dx) 3 = — 

n k U ik dS 

(89.14) 


where ni is the unit vector normal to the surface. If the boundary of the volume 
of integration is a sphere, we may put 


% = !!; dS = r 2 dw 
r 


(89.15) 


where da> is the element of solid angle. 

As a consequence of the symmetry of the V w ‘ in the indices p. and v equations 
(89.11) also give rise to relations analogous to (31.06) and (31.07) which give, 
after integration 


and 


t | (XiU ak - x k U 0i ){dx) 3 = -jnjixiUJ* - x k W)dS 
( i - tU M )(dxf = -jnfxiUJ 0 - tWfdS 


(89.16) 

(59.17) 
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We discuss the physical significance of the volume integrals on the left- 
hand sides of these equations. We putf 



M = c 2 J U 00 (dx) 3 

(89.18) 


P i ~ ° 2 j u oi (dxy 

(89.19) 


M ik = c 2 J (xiU ok — xicU oi )(dx) 3 

(89.20) 

and 

M ia = c 2 J (xiU 00 - tU oi )(dxf 

(89.21) 


(We have written asterisks above the quantities (89.18) to (89.21) so that they 

should not be confused with the constants introduced when solving the equations 

* 

of mechanics.) The quantity M is the total mass of the system, including 

* 

the mass pertaining to the field enclosed within the volume chosen. The P { 

* 

represent the momentum and the M lk the angular momentum of the system. 
* 

The M i0 can be written in the form 

M i0 = MX* - PH (89.22) 

* 

where the X ( are the mass centre coordinates of the system. It is therefore 
* 

clear that the M t0 are the quantities involved in the law of mass centre motion. 

The value of the volume integrals (89.18) to (89.21) may change somewhat 
with the size of the region of integration. This is so because the field also pos¬ 
sesses energy, momentum, etc. However it will be made clear in Section 90 
that with an appropriate choice of the region of integration the resulting 
indeterminateness in the value of the integrals is negligible in comparison with 
their total value. 

We show now that not only the time derivatives of the quantities (89.18) to 
(89.21), but the quantities themselves may be represented as surface integrals. 
To demonstrate this we consider the structure of the expression (89.10) for 
Putting jx = v = 0 we get 

8 2 

WnyV 00 =-— (g°V° — sfV 0 ) (89.23) 

8x & 

For a — (3 = 0 and also for a = 0, (3 — i (where i — 1, 2, 3) the expression on 
the right becomes zero. Therefore the indices a and (3 in (89.23) actually take 
on only spatial values and we can write 



8- 


IGnyU 00 = 

■ -X—q— (a *n 00 -- n'°n*°) 

■dXi 

(89.21) 

whence 



* c 2 f 

8 . , , , 


M = —- 

m -(q'^q 00 -- q'°q V0 )d/SI 

(89.25) 

lr, 'T J 

PXk 


Similarly we have 



1 firry l' oi = 

(‘W - sPV) 

(89.2(3) 





t We write M' t , P ! and A' with upper indites. 
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where j takes on only spatial values, but the values of a include zero. (We 
note that the value a = * actually drops out.) Insertion of (89.26) into the 
volume integral (89.19) gives 

p* = jiy (9 a; 9 01 - 9 a0 9 }i )dS (89.27) 

From (89.26) it is easy to derive the relation 

16irf{XiV ok — x k U oi ) = (9^9°* ~ g“V*) 

dXj 1 ox a 

- Xk ~ (g a; 9 0i — 9 a0 g^) + Q lk Q oi — g^g # *| (89.28) 

the right-hand side of which represents a sum of derivatives with respect to 
space coordinates. Inserting (89.28) into (89.20) and using the Gauss-Ostro- 
gradsky theorem we obtain an expression for the angular momentum in the 
form of a surface integral: 

* c 2 r ( 8 

Mik = i6^ J n} [ Xi faf (9**9°* “ g a V*) 

— x k t—( g^g 04 — g a0 g ji ) + g ?fc g 0( — g^g oi \dS (89.29) 
8x a I 

* 

Finally, the first term in the expression (89.22) for M i0 may be written as 

MX* = nj^xi A (g^g 00 - g^°g* # ) + g ;o g io - g^g°°JdS (89.30) 

Thus all the quantities (89.18) to (89.21) have been represented as surface 
integrals. The value of these integrals depends only on the behaviour of the 
gravitational potentials at large distances. 

When writing down the conservation laws in differential form we have 
used the symmetry of the setf of quantities I7 llv defined by (89.09); as we have 
indicated these quantities represent an analogue of a contravariant symmetric 
tensor. In the literature it is, however, more usual, since the earliest papers 
of Einstein, to employ a different set of quantities which are the analogue of a 
mixed (non-symmetrical) tensor and which are defined as follows. J 
At the end of Section 60 we derived an equation for the variation of the 
action integral, according to which 

8 J Ly/{-g).(dz) = /(**- \f*R) Sg+V(-9H**) (89-31) 


t A different, but also symmetric set of quantities is used in the book by Landau and 
Lifshitz, The. Classical Theory of Fields [23]. 

{The reader not specially interested in comparing oil. form of the conservation laws 
with Einstein’s may skip the rather tedious calculations from here to the end of Section 89. 
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Here L is the Lagrangian (60.23) and ( dx) stands for the product of four 
differentials 

(dx) = dx Q dx 1 dx 2 dx 3 (89.32) 

Since 

= -gutter ( 89 - 33 ) 

equation (89.31) may also be written as 


3 J W(-9) ■ (dx) = - J (R^ - \g^R) 8 sTV(~9) ' (dx) (89.34) 


Putting 


9? = 


dx„ 


(89.35) 


we can define the “ partial derivative ” with respect to g^ y and to g^ by the 
equation 

KW(-9)1 = 8[W d g~ 9)] + 8LL ^[i g)] ^ (89.36) 

Insertion of (89.36) into (89.34) and integration by parts shows that the con¬ 
servative tensor 


= Kv — 

multiplied by \/(— g) may be written as 

,,. r _ 8 8[2V(-g)] olW(- 9)} 

V( 9) dx ° dg ^ 8g[lv 

We now introduce a sqt of quantities w° defined by the equation 

V(- 9)-K = 9T d[W dg ~ 9)] -KiW(-9)] 


(89.37) 

(89.38) 

(89.39) 


Forming the sum of derivatives of (89.39) with respect to the x a and using 
(89.38) we get 

1 3[V(-g)-«fl _ 1 &T , 


V(-9) 8x c 

But for any symmetric tensor we have 

, d 9^ „ _ , 


ri 

2 dx' “ v 

P 


dx. 


G w= ~ k 


G v - V = — 1 > (R 
dx vp 11 


P P 

Therefore the preceding equation may be written as 

1 %/(-?)•<] „ raC 
dx ._ vp 


(89.40) 


(89.41) 


(89.42) 


V(-9) 

On the other hand, the divergence of the conservative tensor (which vanishes 
identically) has the form 

1 dW(-g)-G°] 


V G° — 

° P V(~9) 


dx„ 


i* g v = o 

vp^fi 


(89.43) 


Here the second term coincides with the right-hand side of (89.42). Eliminating 
it from (89.42) and (89.43) we get 

~[V(-9)-Gl + V(~9)-K^ 0 


(89.44) 
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We now express G° in terms of T° by the gravitational equations and put 


-w° = t a 

p p 

Then we can write the preceding equation as 


[V(-9)-r; + V(-9)-n 1 = ° 


(89.45) 


(89.46) 


This equation represents the differential form of the conservation law as it is 
most frequently stated in the literature. The quantity 


k =~~~ m+o 

c 


(89.47) 


corresponds to our f7 pv , but it is not symmetric in its indices. Therefore it is 
not possible to derive from the relation 

* 

8U° 

—s = 0 (89.48) 

dx o 

equations corresponding to the conservation laws for angular momentum and 
for mass centre motion. As for the conservation laws for energy and momen¬ 
tum, both formulations, based on (89.11) and on (89.48) respectively, give equiva¬ 
lent results. We shall demonstrate this, with the omission of some fairly com¬ 
plicated manipulation 

It follows from the definition (89.39) of the w° that 

0<7 Ip 

2Wp = - r « p ~ir ~ SpL + y «4r + {y ° ~ ro) ^ (89 - 49) 

p p 

Hence, using the equations of Appendix 1), wc obtain 

V( 9)-(G° p + «?) = (tTo TP - 9 aT fl a0 )l ( 89 - 5 °) 

^3 Lvl — 9) 

and after nmltiplication by — c : 

* d co d 

16*y U° p = -- jfSL- — (g OT a“ p - !Tg Tfi ) (89.51) 

8 *$W(-9) 8x * 

This equation is analogous to (89.10). We obtain from it 

| h dx ) 3 = ^ J ^ (0^9 #T - Q'WS (89.52) 

Since the integration is over a distant surface we can take the limiting value 
of <7 /v / ( —</) outside the integral. Comparing the expression so obtained with 
(89.52) and (89.27) we may write 

(89 - 53) 

* * * 

where P° M and 1 H has the value (89.27). The last equation can also be 
written in the form 




(89.54) 
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These relations confirm the fact that in spite of the difference in the differential 
forms of the conservation laws, the corresponding integral laws are equivalent. 
In addition, the presence of the quantities (g pr ) a0 in the relations (89.54) shows 
clearly that the total energy and total momentum of the system form a four- 
dimensional vector in the Galilean space-time in which the system is embedded. 


90. The Emission of Gravitational Waves and its Role in the Energy 
Balance 


In Section 89 we stated the conservation laws for energy and other quantities 
as equations expressing the balance of the quantity in question, i.e. expressing 
the fact that a change in the total amount of the quantity enclosed in some 
volume occurs only on account of the flux of the quantity across the surface 
bounding the volume. 

We now examine the question to what extent the flux through the surface 
may be neglected and the quantity in question may be considered constant. 
In other words, we enquire to what extent one can speak of conservation laws 
in the narrower sense. We shall confine these considerations to the conservation 
laws for energy and momentum. 

We use the definitions (89.18) and (89.19) for mass and momentum and 
take the surface of integration to be spherical. Then, according to (89.13) to 
(89.15), we can write : 


* 

dM 

It 


J~ nicU 0k r 2 do > 


(90.01) 


and 


* 

dl» 

It 


n k U ik r 2 d(x> 

J 


(90.02) 


the integration being over solid angle. We can take the surface of integration 
to be so distant that it lies entirely in the wave zone. Using the results of 
Section 87 it is simple to see that there the quantities L' r ‘ defined by (89.06) 
or (89.07) reduce to the N |iV . Taking for the N the values (87.42) and for the 
2 Vv the values (87.45) corresponding to electromagnetic radiation, we may put 


E7 |1V = ak^t 


(90.03) 


where a is the (electromagnetic and gravitational) energy density introduced 
in Section 87. Using equation (87.53) which gives the dependence of the energy 
density a on the distance r, we can also write 

k’ t (90.04) 

r 2 


Taking the value of /e v from (87.21) we have 


j r ak °o('> n) .. o 0 (- n) 

L 0L -= — nr, V lk -- ——— npik 


c-’r- 


and therefore 


n k U ok = 


ct 0 (t, n) 


THk 

"kU lk = >H 


c-r- 


(90.05) 


(90.06) 
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Inserting these values into (90.01) and (90.02) we get 


* 


1M If,,. 

W = ~cJ 

(90.07) 

and 

* 


< L-- = — J ma 0 ( T, n )de> 

(90.08) 

But the density a 0 is an even function of the nt, both for the electromagnetic 
and the gravitational field. Therefore equation (90.08) gives 

dP* n 
___° 

(90.09) 


As for equation (90.07), the integral in it will not be zero because a 0 is a positive 
quantity and it does not tend to zero for increasing r. Therefore an outward 
flux of mass will always take place. 

We consider first that part of this flow of mass which arises from the emission. 
of gravitational waves. To do this we must replace <r 0 by the expression a og 
given in (87.54), into which we must insert the values of / a@ from (87.60) and 
(87.62). For brevity we put 

d 3 

Aa = -^D tle (T) (90.10) 

Equation (87.54) then gives 

°o<7 = g^{^ 2 — %A ( Ai + AiicAoc — l(A — Ajj) 2 } (90.11) 

where we have introduced the short notation 

At = Aocfijc ; A = AiUi = Aan^k (90.12) 

It may be shown that a og depends not on all six A a but only on five combina¬ 
tions of them, namely on the quantities 

Buc = Aa — a&ik ; a = \Ajj (90.13) 

which are subject to the relation 

B n + B 22 + B 33 = 0 (90.14) 

The Ba are the third derivatives of the quadrupole moments 

Bikit) = J P (XiXk — l$ikr 2 )(dx) 3 (90.15) 

for t — t. Expressing the A a in terms of the Ba and inserting first into (90.12) 
and.then into (90.11) we can verify that the quantity a drops out of these 
equations and that, 

=■ + l B "} ( 90 - 16 > 

u one" 

where the quantities /f, and B should be expressed in terms of the Ba by means 
of formulae similar to (90.12). We check that the quantity a„ g is always posi¬ 
tive. Since it is invariant under three-dimensional rotations, it is sufficient to 
make the check for any fixed direction of the vector n that enters the 
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expressions for B t and B. Putting • 


w 1 = 1 ; n 2 = 0 ; n 3 = 0 

(90.17) 

and using (90.14) we get from (90.16) 


a og ~ g^{K -®22 ~~ -®m) 2 + "^ 23 } 

(90.18) 


which is indeed positive. 

Equation (90.07) involves the integral of the expression (90.16) taken over 
all solid angles. To evaluate it we can use the relations 


— J" n<nkdo = ]8tk 

(90.19) 

and 


— J nin k nin m do = i l s(SaSi m + + S<mS«) 

(90.20) 

which are most simply obtained by use of the identity 


4 ^J ( a ‘ n *) 2p do) ~ 2 p+l (a " + ° 2 + a * )P 

(90.21) 


Taking first p = 1 and then p = 2 and examining the coefficients of the powers 
and products of the ai we immediately get the required results. The integration 
in (90.21) is readily performed using a polar coordinate system with its axis 
in the direction of the vector a. 

Using the relations (90.19) and (90.20) we get 


-1 f (B (k B ik - W t Bi + |£ 2 ) do = f BuBa (90.22) 

in J 

Inserting (90.16) into (90.07) and using (90.22) we get 

dM v 

7-S** (90 ' 23) 

This expression gives us the rate at which mass flows out of the system as a 
result of gravitational radiation. The corresponding equation for the rate of 
energy loss is obtained by multiplying (90.23) by c 2 and has the form 


* 

dW 


7 , = — tt-t BikBile 

dt 5c 5 


(90.24) 


This loss of mass and of energy is completely negligible owing to the enormous 
value of the constant 


5c 3 

— = 2 x 10 39 g/sec 
T 


(90.25) 


If B characterizes the order of magnitude of the Btk, then taking the system 
formed by the Sun and Jupiter we can put, in round figures 

- = 10 14 g/sec 
c 


(90.26) 
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since the mass of Jupiter, its angular velocity of revolution around the Sun 
and the ratio of the square of its orbital speed to c 2 have the values 

v\ 

OTj = 2 • 10 33 g; wj = 2 • 10-8 sec; = 2 • 10~» (90.27) 

Dividing the square of the number (90.26) by the value of the constant (90.25) 
we get a loss of mass of 5-10~ 12 g/sec. Translated into energy units this corre¬ 
sponds to the ridiculously small power of 450 W. For comparison we mention 
that the power of Solar electromagnetic radiation is about 4-10 12 g/sec., which 
is about 10 21 times greater. 

This estimate completely confirms the conclusion stated at the end of Section 
88 that in the problem of the gravitational interaction of heavy bodies gravita¬ 
tional waves play no part at all. 

Let us further examine the question with what accuracy a system of gravita¬ 
ting masses may be considered conservative if electromagnetic radiation is 
neglected. 

When we solved the problem of mechanics in Chapter VI we deduced the 
equations of motion to an accuracy that allowed us to find corrections to the 
energy of order Mq*lc 2 where q was some characteristic speed. Equation 
(90.24) shows that apart from electromagnetic radiation we could have gone 
further, up to terms of order Mg^/e 4 . This is the accuracy to which the many 
body problem can be formulated as a problem of mechanics, with its ten 
classical integrals. We recall that in the problem of interacting charges (Sections 
26 to 28) the greatest attainable accuracy corresponds to the order J/g^/c 2 , 
because there radiation plays a much more important r*art. 

91. The Connection between the Conservation Laws for the Field and 
the Integrals of Mechanics 

In Section 89 we derived equations for the time derivatives of the quantities 
* * * * 

M,. I H , M ik , M i0 and found a representation of the quantities in the form 
of surface integrals. The expressions found for the total mass and for the 


momentum 

have the form 





* 

c 2 l 

* 8 





16 tty J 

n< 0a*( 9 9 °° ~ 

- Q io Q ko )dS 

(91.01) 

and 







* 

e 2 

f 8 




2 * = 

16tty. 

\*>w (9 ''» 

' « 

- g*y *)dS 

(91.02) 


The angular momentum is given by 


M ik = J ~ (g^9°* - g°V fc ) 

~ (g^g oi - g ao g ,f ) + 9 y *g°* — g^g ofc JdS (91.03) 

* 

The quantity M i0 involved in the equation of mass centre motion is given by 

M i0 = MX* - PH (91.04) 
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* 

where P i has the value (91.02) and the first term is given by 


* * 

MX* = 



(G>'*G°° — Q i0 9*°) + G^G 10 “ 


Qji Q oo 


dS (91.05) 


According to how distant the surface of integration S is chosen to be, the 
values of these integrals for the total mass, and the other quantities will be 
somewhat different because they will include a greater or lesser fraction of the 
mass, etc., pertaining to the “ pure ” gravitational and electromagnetic radia¬ 
tion. As we have demonstrated in the preceding section, the energy of gravita¬ 
tional radiation is completely negligible. If we also neglect electromagnetic 
radiation we can place the surface of integration not in the wave zone, but 
instead at “ moderately large ” distances from the system of masses ; by 
doing this we, so to speak, cut off the mass and energy of the material media 
and the static fields, from the mass and energy of pure radiation. 

In Sections 84 and 85 we derived approximate expressions for the gravitational 
potentials at moderately large distances from the bodies; these we shall now 
use. According to (85.46) the expression for g°° has the form 


1 4yM 
c c 3 r 


4van d 2 2y d 3 2y f . 7 . 

' ' + 8^1 Mr D}k ~ dx i dx j 8x k ‘5^rj P¥ '"‘ W 

7y 2 M 2 14y 2 iW 2 Z^. 4 8 2 W 


+ 


c°r‘ 


C°T* 


+ 


dt 2 


(91.06) 


The mixed components are given by (85.41) as 


G 0< = ^P iJ r 

C 3 7 " 


2y xjMji 


8 2 2y 


c,r 


dxj dt c 3 r 


8 2 2y f 

Da + -—— — p ViXjx k {dx) 3 

dxj ox]c c 3 r J 


| 7y 2 MP* | y 2 MP k XiXjc 4 d 2 W i 

c 5 r 2 c 5 r* c 5 dt 2 


(91.07) 


Finally, (85.48) gives the spatial components as 
d 2 2v 

Q ik = ~ c$ik + — ® ik 

dt 2 c 3 r 


d^Tdi £ J p(w * + X}XkVi ~ X(XkV M dx ? + Y M c ^ Xk 


(91.08) 


In the equations (91.06) to (91.08) we have written X* and P* instead of Z< and 
Pi in order to stress the vector character of these quantities. The value of the 
quantity Dji is given by (85.39); it is 


D }i (t) = J pa*tyjl + ~(^v 2 + II - |C7)j (da:) 3 + J ?W(dx) 3 

(91.09) 

When inserting Dji into equation (91.08) for g u it is sufficient to retain the 
main terms which are just the usual moments of inertia. The argument of 
D ji is the time t, and not t — r*jc as in the wave zone. 
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The expressions for g 00 and g 04 also contain terms involving W and which 
represent retardation corrections. These terms are approximately given by 


and 


4 8 2 W _ y !8 jk _ x } x k \d 2 D )k 
c 2 dt 2 c 5 \ r r 3 / dt 2 


y d 3 r d 2 C 

******* 

(91.10) 


4 d 2 Wi 

"W 


y X) d 3 D j( y /8 jk 
c 6 r dt 3 c 5 \ r 


XjxA d 2 f 

h^JW 2 ) 


pV(XjX k (dx) 3 


(91.11) 


The expressions for the g 117 just stated contain the quantities M, P*, M ik 
and MX { — P i t the constancy of which is the expression of the mechanical 

conservation laws. We may therefore expect these quantities to coincide with 

* * * * 

the values of the integrals M, P*, M ik and M i0 which represent the total mass, 
momentum, etc., of the whole system, including the field. We now verify this. 

We have to insert the expressions (91.06) to (91.08) for the g 1 * 7 into the surface 
integrals (91.01) to (91.05) and to evaluate them approximately. In this calcula¬ 
tion we shall retain terms of order q 2 /c 2 relative to the main terms when cal- 
* * 

culating M and P 1 , but for simplicity we shall deal only with the main terms in 
* * 

the evaluation of M ik and M i0 . 

Leaving out the retardation term, we see that in all first derivatives of the 
g^ 7 the terms that decrease most slowly with distance are those of order 1/r 2 
which are all odd in the coordinates ay, x 2 , x 3 , except for the derivative 3g u /3t 
which decreases as 1/r and is even in the coordinates. In general the retardation 
corrections decrease more slowly but the correction in 8q °°ldxic also decreases 
as 1/r 2 and is also odd in x 1 , x 2 . x 3 . 

Bearing in mind these remarks we form the expression 


d da 00 8a i0 

»“«“>-n-a£- 


(91.12) 


* 

that enters equation (91.01) for M. The right-hand side is obtained from 
the left by replacing the summation over k by summation over a, including the 
value 0, and by using the harmonic conditions. Since it is sufficient for the 
evaluation to know the odd terms of order 1/r 2 we may replace by their limiting 
values all the g* 17 in (91.12) that are not differentiated. Doing this we get 




eg" 

8x_ 


to 


£1 

ex 


eg 01 




8x ( 


1 eg 10 

c dt 


(91.13) 


This approximation is still valid if the retardation corrections are included, 
bearing in mind their order of magnitude in terms of q/c. 

Using equations (91.06) and (91.07) for g 00 and g <0 , multiplying (91.13) by 
= Xi\r and summing over i, we get the following expression for the integrand 
in (91.01) 


da 00 rii eg <0 

cn ( —- 

8xi c dt 


4y M 
cV 2 



(8y — 3>q?ty) D tj 


(91.14) 
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It is easy to see that the second term on. the right is proportional to a second 

order spherical harmonic so that it gives zero on integration over the sphere. 

* 

There remains only the first term, and the integral for M becomes 

M — M (91.15) 

* 

Thus, in this approximation, the total mass M of the system including the 
field, is equal to the mechanical mass M. 

* 

Similarly we can calculate the quantity P*. We have approximately 


and further 


9' 


H n 


, 8a oi 

<*■} a _ Q-JtO 

8x„ 8x, 


8q°* 1 dtf* 
dx 1 c 8t 


(91.16) 


dQ oi 1 

— cn< — - ni 

dxj c 


8tf* 

■ 8t 


= £ p ~ s i ~W + h <**■ - & J pwI*)" 


(91.17) 


Here terms of the order q 2 /c 2 as compared with the main terms have been 
retained and in them the expansion in inverse powers of r has been taken up 
to terms in 1/r 2 . In forming the expression (91.17) both terms of the retardation 
correction were included. 

On the right of (91.17) the second and third terms are proportional to spherical 
harmonics and give zero on integration. The first term gives 

P* - I* (91.18) 


Thus the total momentum also is equal to the mechanical momentum. 

* 

We now pass on to the evaluation of M ik . In the approximation stated we 
have 


nj( Q ik Q ot — Q ji Q ok ) = —c(n k g 0< — nig 0 *) 
We use equation (91.16) and the obvious identity 


n(Xj = njXi 
* 


(91.19) 

(91.20) 


and can then write the integral (91.03) for M ik , as 

A “=iH 9 “ - * % - 

-*('»“ -*> a s-y-w)} ds < 9i2i > 


To evaluate the integrals containing 8q ik ldt we would need to know g**, 
including terms decreasing as 1/r 3 (we have in mind terms in containing 
c 3 in the denominator). But in Section 84 we have determined g ; * only up to 
terms decreasing as 1/r 2 . Although the evaluation of the missing terms does 
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not present any difficulties in principle, we can avoid it, if in (91.21) we confine 
ourselves to the main terms and observe that the integrals containing dq ik ldt 
contribute only corrections of order g 2 /c 2 . The integrand in (91.21) then 
becomes 


crii 




Xj 


W 

dXj) 


= 4 + 

c l r l c z r 


nkPi) + ~ (*« 


‘in k n])(M]i + f)ji) 


(8„ - 3 n inj )(M }k + f) ]k ) (91.22) 


Here the right-hand side is expressed in terms of spherical harmonics and 
involves harmonics of zero, first and second orders - } - . On integration only the 
first term remains and we get 

M* k = M fk (91.23) 

as was to be expected. 

It remains to evaluate (91.05). Using (91.13), we can write this integral as 
* * c 2 f ( / da 00 1 dqJ°\ 1 ) 

MX ' ~ l6rty J H 8 '* ~ *< S? ~ 7' X ’-W) ~ ^ ” ,(8 “ + ' 

(91.24) 

Since 8g 10 /8t can be expressed in terms of the 8of k l8x k by use of the harmonic 
conditions and the latter quantities may be calculated from (91.08) up to terms 
decreasing as 1/r 3 we could evaluate all terms in the integral. But to simplify 
calculations which are of no great interest we restrict ourselves again to the 
main terms. Then we can write 


cn< 


( 8 “-^H 


4yMX* 

C 2 } -2 


+ nil + W - il (8« - 3 n inj )MXi 


c 2 r ) cV 2 


(91.25) 


Hence, arguing as before, we obtain 


MX* = MX < (91.26) 

* 

and since we have already verified that P l — P l , we have 

M ta = MX* - PH (91.27) 

as it should be. 

The calculations just performed can be thought of as a verification of the 
correctness of the approximate solution of Einstein’s equations given by the 
equations (91.06) to (91.08). At the same time the results of these calculations 
illustrate clearly the relation between the conservation laws stated in general 
form and the integrals of mechanics. 


f If terms of relative order ? 5 /c 2 were taken into account the integrand in (91.21) would 
involve harmonics of up to fourth order. 
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92. The Uniqueness Theorem for the Wave Equation 

In the next section we shall deal with the problem whether for an isolated 
system harmonic coordinates are uniquely determined. In that investigation 
we shall need a theorem which establishes the condition of uniqueness for the 
solution of the wave equation 

Hty = 0 (92.01) 

We shall now prove this theorem for the wave equation with constant coefficients 
for which 


□ l _ 1 i 8 ^ , , 82 4A 

^ c 2 dt 2 \8x 2 8y 2 8z 2 ) 

We subject the function 


(92.02) 


^ = 'M*- 2/. z > 0 (92.03) 

to the following conditions :— 

(a) The condition of boundedness: for all x, y, z and t the inequality 

|^|<iH 0 (92.04) 

shall hold so that the function is uniformly bounded ; 

(b) The condition of decrease at infinity: if the distance from the origin, 
r — \/(x 2 + y 2 + x 2 ) increases without limit the solution shall satisfy the 
inequality 

|4-| <7 (92.05) 


for all t and its first derivatives shall satisfy 

M 

| grad ij; | < — 1 ; 


1 cty 
c dt 


< 


Mi 


(92.06) 


Thus the function ^ and its first derivatives are to decrease at infinity in inverse 
proportion to r, or more rapidly. The quantities M 0 , M and M x are positive 
constants. 

(c) The condition of outward radiation: for r oo and all values of t' 0 — 
t. + rlr in an arbitrary fixed interval the limiting condition 


lim 


f8(n JO 1 d{r\)\ 

c 


i dr 


dt ) 


= 0 


(92.07) 


shall be satisfied. It expresses the fact that there are outgoing waves only, waves 
coming from outside being absent. 

We can now formulate the following uniqueness theorem. 

The solution of the homogeneous wave equation QJj = 0 which satisfies the 
conditions (a), (b) and (c) is identically zero. 

In the proof of this theorem we start by using Kirchhofi’s formula for the 
solution of the wave equation.f Kirchhoff’s formula expresses the values of 


f For Kirchhoff’s formula and its generalizations, S. L. Sobolev’s formulae, see V. T. 
Smirnov, A Course of Higher Mathematics, Vol. 11, Sec. 202, 1957, and Vol. IV, Sec. 148,1958. 
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the function at a point in space r 0 at time t 0 in terms of the values of ip and 
its derivatives on a surface S surrounding this point, the value of ij; on each 
point r of the surface being taken not at time t 0 but at the earlier time 

t = t 0 — - I r — r 0 1 (92.08) 

C 1 1 

Kirchhoff’s formula reads as follows : 


i ffpran rn w , is»[i«i.. 

' Mr “ '•> " li J J Ifl L s 7 j ” ™— + S * b a J r 


Here R denotes the distance 


R = 


(92.09) 

(92.10) 


and £><];/3v is the derivative in the direction of the outward normal to S : 


dd> d’li 8b 8<b 

—• = — cos (nx) H-cos (ny) -)-cos (nz) (92.11) 

0 v 8x V Sy y 8z V ' V ’ 


The bracketed expression [<p] signifies that the argument t must be replaced 
by its value (92.08), this replacement being made in the derivatives <ty/3v 
and Sty/dt after the differentiation. 

We take for S the surface of a sphere of radius R eentred at r 0 . Then we 
can put 

dS — R 2 cUo (92.12) 

where dco is the element of solid angle. Further 

cos (nx) = X - ; cos (ny) = ^ ; cos (nz) = Z ■■ Z °- (92.13) 

R R R 


and since 



(92.14) 


we can also write 


(fy Sty 
8v 8R 


(92.15) 


Then Kirchhoff’s formula becomes 


4,( r o, < 0 ) _ _ JJ{ 


8R c dt 


The expression in curly brackets under the integral sign is given by 
8\R^} 1 8[R<\,] 


8K 


- + - 


= +(r, t) 


c dt 

8b 8b ddi i R 8<\i 


(92.16) 


(92.17) 


where t has the meaning (92.08). 
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According to (92.16) the value of the function ^ at r 0 , t 0 is the mean over 
solid angles of (92.17). In order that <\> should be zero at r 0 , t 0 it is evidently 
sufficient if for increasing R the expression (92.17) tends to zero : 


lim («] + iW]} 

r~>oo l r)K c at )t=u-(R!c) 


(92.18) 


For any point r 0 remaining at a finite distance from the origin this requirement 
will certainly be satisfied if conditions (a), (b) and (c) hold, for (92.17) can be 
written in the form 


where 


mi, i«] , , 

-aR" + c“0T = 9 + 9l + <P2 

, 8di 8<b &L r ddi 

9 = +( r > 0+ a; x^+y57+ z -^-+-^7 
dx 8y 8z c 8t 


= 


8di dtlA 
R — r &\i 


(92.19) 


(92.20) 


(92.21) 


(92.22) 


l-R — r\ < r 0 


the equations (92.06) of condition (b) give 


< r ^M 1 

r 


(92.23) 


(92.24) 


If r 0 is fixed and r -> oo these expressions tend to zero. Therefore we have 
separately 9i 0 i 9*-^° (for r -> oo) (92.25) 

and for (92.18) to hold it is sufficient that 

9 -> 0 (for r -> oo) (92.26) 


or 


lim 

r—>-oo 


( 0 (r» 10 ( 4)1 

1 dr c 8t ) 


(92.27) 


This condition, as well as (92.18), is to be satisfied for values of r and t such 
that when r —^ oo, the quantity t 0 = t -f- R/c remains fixed. Now let us intro¬ 
duce the quantity t' 0 — t + r/c into the formulation of condition (c). We have 
|/ 0 — t (l I ^ r n /c, and since r 0 is also fixed, the quantities t 0 and t‘ 0 will both 
belong to some arbitrary fixed intervals if one of them belongs to such an 
interval. One can therefore replace t 0 by t’ 0 in condition (92.27) which then 
becomes the condition of outward radiation (92.07). This latter is a sufficient 
condition for the limiting equation (92.18) to hold, whence 4 1 = 0 follows 
directly (see [43]). 

We have proved that the function 9 is zero at the point r 0 , but since this 
point may be chosen arbitrarily this proves that ^ vanishes everywhere. There¬ 
fore the uniqueness theorem formulated above is proved. 
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We now consider the solution of the inhomogeneous wave equation 

1 dty 

= -fep (92.28) 

expressed in the familiar form of a retarded potential 

-Kr 0 , to) = f (92 ' 29) 

J | r — r o| 

where [p] = p(r, t) ; t — l 0 — - Ir — r 0 | (92.30) 

The integration in (92.29) is over the coordinates (a;, y, z) and extends over 
all that volume where the “ density ” is non-zero, i.e. over the region of the 
masses. If this region lies entirely within the sphere r ^ a and if the time 
t„ satisfies the inequality ct 0 > r 4- a the calculation of the integral (92.29) 
requires the values of p only for positive values of the argument t. 

At large distances from the masses the function y, which satisfies the homo¬ 
geneous equation outside the masses, has the asymptotic form 

4»(r,«) = (92.31) 


where n is the unit vector in the direction of the radius vector r. The function 
[j. is expressible in terms of p as follows 

+ (92.32) 

These equations show that the retarded potential satisfies the conditions 
(a), (b) and (c) formulated above. By virtue of the theorem just proved it is 
therefore the only solution of the inhomogeneous wave equation (92.28) that 
satisfies these conditions. Thus, the theorem givas a mathematical justification 
for the use of the retarded potential; usually it is derived from physical con¬ 
siderations. 

The uniqueness theorem we have formulated has only been proved for the 
wave equation with constant coefficients. One should expect it to remain true 
also for such equations with variable coefficients cj !i ' which have the asymptotic 
behaviour defined in Section 87. The proof of the theorem for the general 
case presents considerable difficulties and is an unsolved mathematical problem. 
It is, however, much simpler to prove the theorem for the case that one can 
restrict oneself to the “ stationary ” approximation for the g^, and may 
write the wave equation in the form 

n 2 8 2 <L 

_^-A + = 0 (92.33) 


where A is the Euclidean Laplace operator and the “ refractive index ” n has 
the value 


, W 
n = 1 + — 
c 2 


(92.34) 


Here V is the Newtonian potential, which for this problem may be considered 
to be independent of time. One could construct the proof either with the aid of 
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S. L. Sobolev’s formuJat which is a generalization of KirchhofPs for equations 
of the form (92.33) or one could expand tj>, considered as a function of t, in a 
Fourier integral and use the well-known Sommerfeld radiation conditions, which 
in this case are consequences of our condition (92.27). 

In conclusion we remark that the statement of the problem adopted in this 
section differs from the more usual statement in terms of Cauchy’s problem in 
that we do not explicitly introduce initial conditions, but instead study such 
solutions of the wave equation which do not depend on the initial conditions 
within the region of space and the interval of time in question. This formulation 
is dictated by the physical nature of the problem. In it the solution is made 
unique and independent of the initial conditions by limiting the space-time 
region : any initial disturbance that may have occurred in this region is sup¬ 
posed to have dispersed long ago and further disturbances cannot enter the 
region because of the condition of outward radiation ; in the case of the inhomo¬ 
geneous equation they may be generated within the region, but then they are 
determined not by initial conditions but by the “ density ” p. 

93. On the Uniqueness of the Harmonic Coordinate System 

When solving-Einstein’s equations for an isolated system of masses we used 
harmonic coordinates and in this way obtained a perfectly unambiguous 
solution. We found unique results not only for finite and “ moderately large ” 
distances from the masses, when the wave-like, i.e. hyperbolic, character of 
Einstein’s equations was not essential and was accounted for by the introduc¬ 
tion of retardation corrections, but also for the “ wave zone ”. This would 
lead us to expect that the harmonic conditions together with the requirement 
of Euclidean behaviour at infinity and with the condition ensuring uniqueness 
of a wave type solution should determine the coordinate system uniquely 
apart from a Lorentz transformation. 

We now consider this question in detail. If Cty is the invariant d’Alembertian 


Q|» 

the harmonic conditions 

1 0 fer 

(93.01) 

V{-g) dx ^V dxj 

may be written in the form 

ii 

o 

(93.02) 

1 

_^l V = na: =0 

(93.03) 

V(~ 

so that each of the functions 

-g) 

<\) = 

= aq ; <\>-=x 2 ; — x 3 

(93.04) 

is a solution of d’Alembert’s equation 



Cty — 0 

(93.05) 


t Sobolev’s formula differs from Iiirchhoff’s formula (92.09) essentially in that it involves 
not only a surface integral but also a volume integral. Reasoning as described above, we 
can take the integral over the infinitely distant surface to vanish by virtue of the con¬ 
ditions on ij;. This then gives not directly ij/ = 0, but instead a homogeneous integral 
equation which has zero as its only solution. The quantity i' 0 — t - J- rjc in the statement 
of condition (c) of the theorem is to be replaced by ift = l -f- r*jc with r* from (87.12). 
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Evidently this equation is also satisfied by any constant and any linear function 
of the harmonic coordinates. 

We first consider the case that the metric of space-time is Galilean. In 
Galilean coordinates 


x 0 — ct; x x = x ; x t = y, 

x 3 — z 

(93.06) 

the components of the metric tensor aref 



r = eAv 


(93.07) 

and the d’Alembertian has the form 



" £ 
d. 

II 

□ 


(93.08) 


Evidently each of the Galilean coordinates is harmonic. 

We seek the most general form for harmonic coordinates. Let us assume 
transformation equations to new coordinates of the form 


x a — f*(x 0 , x x , x 2 , £ 3 ) (93.09) 

The new coordinates, like the old, must satisfy d’Alembert’s equation, they 
must be Galilean at large distances and must lead to values of the g WJ with the 
correct asymptotic behaviour. Remembering these requirements we shall 
write the functions /“ in the form 

/“ = «« + + n a (x 0 , x x , x 2 , x 3 ) (93.10) 

Here the o a and the a^ are the coefficients of a Lorentz transformation and so 
satisfy the relations 

e a a = e 8 ; e a a = e 8 (93.11) 

a ga va p. [xv > a ag av gi gtv V > 

Since the functions / 01 must satisfy the equation 

□/“ 0 (93.12) 

and the linear terms satisfy this equation separately, we must have 

□ 7 )“ = 0 (93.13) 

This equation must hold throughout space and at all t (i.e. all a; 0 ). Since the 
equations (93.13) involve second derivatives it is natural to require that the 
functions -rf themselves and their first derivatives should be bounded throughout 
space and at all t. This requirement is all the more necessary because the first 
derivatives of the new coordinates with respect to the old enter the transfor¬ 
mation equations for any tensor. As for the conditions at infinity, since there 
the new coordinates are to be Galilean, the functions r* which represent additions 
to the Lorentz transformation, must tend to zero together with all their first 
derivatives. In addition, the requirements resulting from the asymptotic 
behaviour of the must also be satisfied at infinity. We consider these require¬ 
ments in more detail. Without restricting generality we can assume that the 
Lorentz transformation occurring in (93.10) is the identity, so that that equation 
can be written as 

/“ = z a + vf (z 0 , x u x z, x s) (93.14) 

f When using the summation convention for indices occurring twice the index of 
e n ( e o = 1; e i — e t — e > — — 1) * s not to be counted. 
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Calculating the components of the metric tensor in the new coordinate system 
by the general rule 


_ dx dx'„ 

0 '«3 _ _ 5 __ 3 

* dx dx 

IX V 


(93.15) 


we get 




drf drf drf 
+ 6&dX » + 6 “ ^ 8 \ 


(93.16) 


By the condition at infinity the quantities drfjdx^ tend to zero. Therefore 
the asymptotic behaviour of the g"® is characterized by the terms linear in these 
quantities. But this asymptotic behaviour must be such that the difference 
between p® 0 and its limiting value at infinity is an outgoing wave. (Corres¬ 
ponding relations for the g“ 0 were introduced in Section 87, and similar relations 
for the y* 0 follow from them.) Thus we must have at infinity 


p® 0 — (S'® 0 )* = outgoing wave (93.17) 


This requirement must be satisfied in any harmonic coordinate system, so that 
the < 7'“ 0 of equation (93.16) must also satisfy it. It follows therefore that 
drf drf 

e®- 7 — + e,j — = outgoing wave (93.18) 

If we put if = e a t] a (93.19) 


and divide through by e a e p we can write this condition in the form 

+ — = outgoing wave (93.20) 

8x a 8x & 

Using the identity 


J_ _£_/£5§ , fVl g 8 \ 

8xp\dx v 8xJ 8x a \8x & 8xJ dx\dx a dxj dx a 8x & 

(93.21) 

and passing from the rj v to the yj v we conclude that 


-—— = outgoing wave (93.22) 

CX a GXq 

Hence one can deduce that if is the sum of a linear function and of a function 
which at infinity represents an outgoing wave. The same conclusion evidently 
applies not only to the functions if entering (93.14) but also to the if entering 
(93.10). Since in (93.10) the linear function has already been split off and since 
we have the condition at infinity if —> 0, we arrive at the conclusion that the 
function rf itself represents an outgoing wave : 

if = outgoing wave (93.23) 


The conditions satisfied by the additional function rf in (93.10) can now be 
made more precise. This additional function satisfies the wave equation (93.13) 
and remains bounded everywhere together with its first derivatives. At infinity 
both if and its first derivatives decrease as 1/r. Finally, from (93.23) we can 
infer that at i nfini ty rf satisfies a condition of outward radiation of the form 
(92.27). All these conditions must be satisfied for all values of t. By virtue of 
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the uniqueness theorem proved in Section 92 a quantity rf satisfying these 
conditions vanishes identically. Therefore the function/” that enters (93.09) 
and (93.10) reduces to a linear function and the most general form of trans¬ 
formation from one system of harmonic coordinates to another is 

< = a * + V*«p*3 (93-24) 

i.e. it amounts to a Lorentz transformation. 

We have proved this result for the case of Galilean space-time with the 
metric (93.07), because the uniqueness theorem formulated in Section 92 was 
proved for that case. However, it appears certain that this theorem can also 
be proved for the genera! case of an Einstein space-time whose metric tensor 
has the asymptotic behaviour established in Section 87. If we accept this we 
reach the conclusion that in the general case also the harmonic coordinate 
system is uniquely defined apart from a Lorentz transformation. Indeed, 
the reasoning of this section remains valid in the general case. In a harmonic 
coordinate system the d’Alembertian has the form 

Qb = o» v (93.25) 

and does not involve any first derivatives. Consequently any linear function 
of the coordinates satisfies the wave equation. Therefore, if we write down a 
transformation of the coordinates in the form (93.09), where f a has the value 
(93.10) and if we require that the new coordinates should also be harmonic 
(equation (93.12)) we shall again obtain for the additional function the wave 
equation (93.13), but with □ now denoting the operator in (93.25). The 
additions yf and their first derivatives must evidently be bounded throughout 
space. As for the remaining conditions imposed on the if, they all refer to 
the infinitely distant regions of space, where the metric differs little from 
the Euclidean. Therefore these conditions may be safely taken over from the 
previous case. In particular, the left-hand side of (93.18) will represent the 
difference between the asymptotic expressions for the in initial and trans¬ 
formed coordinates and must, therefore, correspond to an outgoing wave. Hence 
we conclude as before that the relations (93.22) and (92.23) are valid and then 
the uniqueness theorem leads to the conclusion that the additions y f vanish. 

Thus in the general case also, the harmonic coordinate system is uniquely 
determined apart from a Lorentz transformation. 

We should note that we have not explicitly introduced any initial conditions 
for the functions /” that define the coordinate transformation. This statement 
of the problem corresponds to the statement of the problem of determining the 
gravitational potentials / tv . In that case there is sense in introducing initial 
conditions only for the distribution of the gravitating masses, but not for the 
gravitational waves and not for the gravitational potentials themselves. The 
problem of determining the gravitational potentials is not Cauchy’s problem, 
i.e. not a problem with initial conditions, but rather the problem of finding 
a steady state which sets in when all gravitational waves have dispersed, other 
than those produced by the motion of the masses under consideration. (See 
above, end of Section 92.) 

Our discussion shows that the conditions ensuring the uniqueness of the 
coordinate system arise directly from the formulation of the physical problem. 
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Since this is not Cauchy’s problem as regards the determination of the gravita¬ 
tional potentials it will not be Cauchy’s problem in relation to the determination 
of the coordinate system. The determinateness in the formulation of the 
problem of the gravitational potentials is achieved by excluding “ transient ” 
gravitational waves by the condition of outward radiation. But by the same 
means we exclude the corresponding wave terms in the equations for the co¬ 
ordinate transformations, and thus in turn we achieve the uniqueness of the 
coordinate system. 

The question of the uniqueness of the harmonic coordinate system in the non¬ 
stationary case may also be approached from another direction, namely from 
the approximate method for solving the gravitational equations used in 
Chapter VI. If one follows the calculations performed there one finds that at 
all stages the solution, and therefore the coordinate system also, are obtained 
uniquely. Thus the uniqueness of the harmonic coordinate system can also be 
taken as proved in all cases in which the method of Chapter VI is applicable. 

Let us formulate our conclusions once again. In the case of an isolated system 
of masses there exists a coordinate system which is determined, uniquely apart from 
a Lorentz transformation if suitable supplementary conditions are imposed. This 
is the harmonic, coordinate system, characterized by a first order linear differential 
equation and by boundary conditions for the g !J \ 

The question may arise: can any coordinate systems other than the har¬ 
monic, exist, defined apart from a Lorentz transformation? This seems to us* 
to be unlikely. The harmonic coordinates are characterized by the fact that 
each of them, and also any linear combination, satisfies a linear, generally 
cpvariant equation. One could hardly find other coordinates with this property; 
other coordinate conditions, even when very similar to the harmonic condition 
(93.02), for instance dg^jcx^ — 0 do not reduce to a linear equation. 

The fact that for determinateness Einstein’s ten gravitational equations 
must be supplemented by four additional equations is in itself evident. The 
difficulty consists in choosing these supplementary equations and in formulating 
boundary conditions. 

In contrast with our statement of the problem, the formulation given to it by 
Einstein leaves it mathematically indeterminate and there can be no question 
of uniqueness of the solution. According to Einstein there are no privileged 
systems of coordinates and the system of coordinates remains indeterminate 
to the end. Einstein and those who support this point of view raise this indeter¬ 
minacy to a virtue and see in it a deep meaning, namely the expression of some 
“ general principle of relativity ”, on the strength of which Einstein’s theory is 
called by its author the “ General Theory of Relativity ”. 

We cannot a'gree at all with such a point of view. The indeterminateness 
existing in Einstein’s formulation is by no means a matter of principle and has 
no deep significance whatsoever. It arises in any physical theory that can be 
formulated in a generally covariant manner. The so-called “ special ” theory 
of relativity has actually been formulated in a generally covariant manner 
(Chapter IV of this book) and thus possesses, in this formulation, the property 
of indeterminateness to just the same degree as the so-called “general” theory 
of relativity (the theory of gravitation). What really matters is not the pos¬ 
sibility of an indeterminate formulation (which is trivial) but just the opposite, 
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namely, a formulation that is as determinate as is allowed by the nature of the 
problem. From this point of view, the existence of a preferred set of coordinates, 
determined apart from a Lorentz transformation, is by no means trivial, but 
reflects intrinsic properties of space-time. This is most clearly seen in the case 
of the “special” theory of relativity, where the existence of Galilean coordinates 
reflects the uniformity of space-time. 

If this is made clear, the obvious necessity of introducing additional equations 
in order to define privileged coordinate systems (and thus to achieve a more 
determinate formulation of the problem) cannot give rise to any serious objec¬ 
tions. This necessity exists in the theory of Einsteinian as well as in that of 
Galilean space-time. The fact that in the latter case no such equations are 
usually written down does not alter anything, because such equations can be 
written down or alternatively may be replaced by the verbal statement that 
Galilean coordinates should be used. 

It is obvious that the additional equations mentioned are not generally 
covariant (otherwise they would not restrict the coordinate system.) Thus the 
objection may be raised that the additional equations destroy the beauty of 
the theory which according to the current opinion, lies in its general covariance. 
But this argument provides no real objection, in the first place because these 
equations must be compatible with the generally covariant Einsteinian equations, 
the simple form of which constitutes the real beauty of his theory. It should 
be added that the simplicity and beauty of the theory is due not so much to 
its covariant form as to its idea of unifying metric and gravitation. 

The uniqueness of the solution achieved by the use of a well-defined privi¬ 
leged coordinate system seems to us an undoubted advantage rather than the 
contrary. This uniqueness shows the physical meaning of the privileged system, 
while the actual physical significance of any other coordinate system only 
becomes clear given the equations relating it to the privileged system. 

As for the so-called “ General Principle of Relativity ” no such physical 
principle can exist, as was explained in detail in Section 49*. In contrast the 
Galilean relativity principle for uniform straight line motion, and its generali¬ 
zation expressed by the Lorentz transformations, are physical principles. 

With the assumption of a rigid metric (i.e. one that does not depend on the 
physical processes occurring in space and time) the Galileo-Lorentz relativity 
principle leads to uniform space-time and, conversely, is an expression of this 
uniformity. In the case of non-uniform space-time one can consider the exis¬ 
tence of harmonic coordinates to be a further generalization (or wider applica¬ 
tion) of the Galilean principle of relativity; this reasoning corresponds to the 
generalized definition of the principle of relativity put forward in Section 49*, 
as a statement about the equivalence of frames of reference within a certain 
class. As an illustration let us consider the expressions (91.06) (91.08) for the 
g |iv . If we assume that the constants involved in these expressions are fixed and 
perform a Lorentz transformation we shall change the form of the expression 
for the cj |iv , because the space-time is not uniform. But if we also transform 
the constants (M, P l ) and (M oi , M }t ) by the same Lorentz transformation we 
restore the form of the g |iv . This just means that if a process (the motion of 
masses) is possible in which the constants of motion have some given values in 
one frame of reference, then a process is also possible in which these constants 
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have those values in another frame of reference; in other words it means that 
the principle of relativity is satisfied In the form that was given to it In 
Section 49*. 

Here we have considered the approximate expressions for the and only 
the constants of motion, not any other quantities characterizing a system of 
bodies. But our conclusions also remain in force for the exact solutions, because 
the gravitational equations, the harmonic conditions and also the boundary 
conditions are strictly covariant with respect to Lorentz transformations. 
Thus the principle of relativity expressed by the Lorentz transformations can hold 
in non-uniform space also, but a general principle of relativity cannot be valid. 

We have stressed repeatedly the fundamental significance of the existence 
of a preferred coordinate system, defined apart from a Lorentz transformation. 
It reveals itself, among oth^r facts, in the following. Only if the existence of 
such a coordinate system is recognized can one speak of the correctness of the 
heliocentric Copernican system in the same sense as this is possible in New¬ 
tonian mechanics. If the existence of the preferred coordinates is denied, one is 
led to the inadmissible point of view that the heliocentric Copernican system 
and the geocentric Ptolemaic system are equivalent. Such a point of view is 
counter to the definition of physical equivalence of frames of reference given in 
Section 49* and is therefore inadmissible. 

The above remarks concerning the privileged character of the harmonic 
system of coordinates should not be understood, in any .case, as some kind of 
prohibition of the use of other coordinate systems. Nothing is more alien to 
our point of view than such an interpretation. Our aim was to clarify the question 
of coordinate systems in the restricted statement of the problem of the theory 
of gravitation discussed in the preceding chapters, where an “ insular ” 
distribution of matter is assumed. The intention was to show that this question 
has the same answer as in the case of the Galilean space-time of the “ special ” 
theory of relativity. Nobody would assert that in the latter case the existence 
of Galilean coordinates implies a prohibition of the use of other coordinates. 
Likewise, in the case of the Theory of Gravitation, the existence of harmonic 
coordinates though a fact of primary theoretical and practical importance, does 
not in any way preclude the use of other, non-harmonic, coordinate systems. 
The harmonic coordinates in Einstein space are no more (though no less) 
privileged than Galilean coordinates in uniform Galilean space. 

94, Friedmann-Lobachevsky Space 

When we stated the problem of solving the gravitational equations for a 
system of masses, we postulated that this system should be isolated and 
embedded in infini te Euclidean space or, more accurately, in Galilean space-time. 
Such a statement of the problem is undoubtedly admissible when considering 
objects of the dimensions of the Solar system. These dimensions may be 
characterized by the radii of the orbits of the heaviest planet, i.e. of Jupiter 
(778 millio n kfn). and of the most distant planet, i.e. Pluto (5900 million km). 
Light from the Sun reaches Pluto in about 5-5 hours. The nearest fixed star 
(a Centauri) is at a distance of 4-3 light years from the Sun, i.e. about 7,000 
times more distant than Pluto. 

It is possible that such a statement of the problem is still permissible when 
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considering a single star cluster or even the whole Galaxy. The dimensions 
of the galaxies] - are still some tens of times smaller than their mutual distances. 
But if one considers regions of space so great as to include many galaxies— 
and such regions of space are observable with large modern telescopes—the 
notion of an isolated system embedded in Euclidean space evidently becomes 
inapplicable. 

The question arises with what to replace the Euclidean background which 
we have hitherto considered? What kind of space-time should one take to be 
the background from which the separate mass systems stand out? 

Another reason for the need to replace the Euclidean background by some 
other is that if one considers masses distributed throughout Euclidean space 
with a uniform mean density the well-known paradox of Seeliger arises which 
amounts to the fact that the Newtonian potential of a uniform mass distribution 
does not exist. At the same time astronomical observation shows that through¬ 
out the region of space accessible to observation, up to distances of the order of 
10 9 light years, the distribution of galaxies seems to be uniform on the average. 
On the other hand, if the Newtonian potential does not exist it is clear that the 
form of the solution of Einstein’s equations must be essentially different from 
that which corresponds to an isolated system of masses and which allows the 
gravitational potentials g v -' 1 to be approximately expressed in terms of the 
Newtonian potential. Owing to the altered character of the problem the choice 
of coordinates must also be made anew. 

The solution of Einstein’s equations which corresponds to an isotropic space 
with a unif orm mass density and which may be taken as the background in 
the study of enormous distances that include many galaxies was obtained in 
1922 by the Russian scientist A. A. Friedmann [24]. As we shall see this solution 
permits the introduction of coordinates in which space possesses the properties 
of Lobachevsky’s geometry. We shall therefore call the corresponding space- 
time manifold the Friedmann-Lobachevsky space. 

The theory of Friedmann-Lobachevsky space may be derived starting from 
the assumption that the expression for ds 2 is invariant under the homogeneous 
Lorentz group. It may then always be reduced to the form 

ds 2 = H 2 (S)(dx 2 — dx\ — dx\ — dxj) (94.01) 

where 

s = VK - x\ - x\ - af) (94.02) 

Though this space-time is not itself Galilean, equation (94.01) shows that it 
may nevertheless be mapped conformally on a Galilean space-time. The 
group of homogeneous Lorentz transformations exists in this space and this 
ensures the isotropy of the space. In particular, the origin of the spatial coordin¬ 
ates is in no way singled out from any other point in the space; any fixed point 
may be transformed so as to become the origin by means of a homogeneous 
Lorentz transformation. Thus the homogeneous Lorentz transformations play 
a twofold role: they effect both a transfer of the origin of coordinates and a 
passage to a moving reference frame. 


t By the Galaxy (with a capital “ g ”) one means that stellar system to which the Sun 
belongs. Other similar stellar systems are termed galaxies (with a small “ g ”). 
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We write ds 2 in the form 

ds 2 = H 2 (e^ dx 2 ) 

We then see that 

V = H \ K* > 3^ = <V 8^ 

For the Christoffel symbols we get the expressions 


^£(3- gjjj -( e a ^pS X a + e 0 — e a ^aP^p) 


We use the equation 


ar p 8T p n 

_ 1*3 iPoTV _pa pp 

dx n dx ' ^ ^ aa 

3 a 


(94.03) 

(94.04) 

(94.05) 

(94.06) 


to form the mixed fourth rank curvature tensor (see (44.01)) and obtain 
Po _ \d 2 io g H Id log hf /dio g hr\ 2 ] 
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«A^-«Pp*J (94-07) 


Here we put a = p, (3 = v and sum over p, getting 
„ \d 2 \ogH , 5 d log H , Jdlogtf\ 2 ] , 

fi “’ = Ns r ~ + S _ Zs _ + 2 l _ ir-) J v 


We calculate the invariant 


d 2 log H 
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d 2 log H 3 d log H (d log II 
dS 2 + 5 dN + \ d-Sf , 
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Now we can form the conservative tensor 

^nv = K, ~ i3^ R 

From (94.08), (94.09) and (94.04) we obtain 

„ __ _ |^od 2 log H , 4 d log // , (dlogHy 


(94.09) 

(94.10) 

(94.11) 
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By Einstein’s equations we have 


r 

“m ‘ „2 'iv 


(94.12) 

(94.13) 


As we have indicated in the derivation of these equations, the mass tensor T 11V 
in the cosmological case, when conditions at infinity are absent, is determined 
only up to an additive term of the form A< 7 tlv ; in this case the constant character- 
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izes not only the hiass tensor itself but afeo the properties of space as a whole, 
and choosing a value for it represents a special hypothesis. 

Our basic aim is the examination of Friedmann-Lobachevsky space, but 
before we pass on to this task we shall touch briefly on the case of a maximally 
uniform space, to which we referred repeatedly in our previous discussions. 
In maximally uniform space the relations (49.12) must be satisfied. They 
may be written in the form 


K. «f> = K(Kg& - 8&J (94.14) 


Comparing this expression with (94.07) we obtain agreement if we put 

d 2 log H 1 d log 11 id log Il \ 2 

~~dS* S ~dS \ dS ) 


(94.15) 

(94.16) 


Taking 1 /H as our new unknown function we obtain from (94.15) a linear 
equation, the solution of which has the form 

I = .-W (94.17) 

H 

Insertion of this expression into (94.16) gives a relation between the constants 

4 ab = K (94.18) 


Putting a — 1 we get b = \K so that 


and therefore 


1 

H 


1 - - KS 2 
4 


dS 2 = 
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fl - \K-{xl -x\-x\- x\)f 


(94.19) 

(94.20) 


In addition to the group of homogeneous Lorentz transformations, which con¬ 
tains 6 parameters, the expression (94.20) also allows a more general group of 
transformations with 10 parameters; we shall not state these here (see ref. [ 14]). 
It follows from equation (94.14) that 


^ = 3^; fi=12tf (94.21) 

whence 

(94.22) 

Thus a maximally uniform space corresponds formally to a mass tensor given by 


T„ 


3 K 

— 9 ^ 


(94.23) 


Coming back now to the Friedmann-Lobachevsky space we assume that the 
mass tensor has. the form 


T «v = P*V, 


( 94 . 24 ) 
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where p* is the invariant density and the «„ are the covariant components ot 
the four-dimensional velocity normalized by the relation 


uy = 1 (94.25) 

In the usual (non-cosmologieal) statement of the problem the mass tensor (94.24) 
corresponds to “dust-like” matter at zero pressure (see equation (32.17)). 
In the cosmological statement the choice of the additive constant in the pressure 
(as of the constant K in equation (94.23)) represents a special hypothesis, as was 
mentioned, which characterizes the properties of the space as a whole. Our 
choice of mass tensor, though not the only one possible, allows us to construct 
a theory which reflects correctly some features of observed phenomena. 

If the conservative tensor (94.12) and the mass tensor (94.24) are to be 
proportional, the first term in (94.12) must be zero. This gives 


If we put 


0 d 2 log H 4 <1 log H 
" dS 2 + S dS + 



H = H\\ log H = 2 log H x 


equation (94.16) becomes linear 


d 2 H x + 4 dH x 
~dSA S~dS 


(94.26) 

(94.27) 


(94.28) 


and may readily be integrated. If we assume that at S =- 00 we have H x -- 1, 
which means that space time is Galilean and the scale factor is the usual one, 
we get 

H x = 1 - | (94.29) 

and therefore 

With this value of H we have 


G.. 


12 A 

S*{ 1 - 4/S) 2 


(94.31) 


and Einstein’s equations become 
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(94.32) 


We form the invariants on both sides of the equation, getting 
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(94.33) 


These invariants are equal to R ; therefore we could have used equation (94.10) 
directly with the value of H from (94.30) inserted for calculating the right- 
hand side of (94.33). Since the density o* is positive the constant A must also 
be positive. 

With the use of (94.33) equation (94.32) gives 
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and, on going ox er to contravariant components xve get 


(94.35) 


We have obtained a solution of Einstein’s equations corresponding to a non¬ 
vanishing mass density p* uniformly distributed over all space. (In Newtonian 
theory such a solution does not exist.) This result was first obtained by A. A. 
Friedmann in the 1922 paper mentioned above [24]. 

We write the solution obtained in explicit form. We have 


&2 = l 1 “ Si) _ dx * ~ dx * _ ( 


(94.36) 


where 

S = V(*l - a? - 4 ~ 4) < 94 - 02 ) 

It is interesting to note that this expression is invariant not only under the 
group of homogeneous Lorentz transformations, but also under inversions 

< = < 94 - 37 ) 

The latter, however, have no direct physical significance because they transform 
the region S > A into the region S' < A, and only the case S > A has physical 
significance. It is also easy to find the equations relating the coordinates x• 
to harmonic coordinates namely 


* 4A A *\ 

<={ 1 + S s + r-h 


(94.38) 


The harmonic coordinates do not change under the inversion (94.37). In the 
problem of isolated masses the preferred role of the harmonu. coordinates was 
based on the boundary conditions discussed in Section 93, including the con¬ 
dition of Euclidean behaviour at infinity. In Friedmann-Lobachevsky space, 
however, the conditions are, quite different and the “ conformally-Galilean ” 
coordinates, in which d. s 2 is of the form (94.36), are more appropriate to the 
character of the problem. 

We discuss the motion of matter that corresponds to the solution obtained. 
In the equation of continuity 

Z>W(-9)- P*<I n 


we must put 




whence it is evident that it is satisfied by virtue of the identity 

frvVSV 

The velocity field is characterized by the quantities (94.35). We have 


(94.39) 


(94.40) 


(94.41) 


(94.42) 
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For directness of interpretation we put x 0 = ct and can then write 



dj‘i Xi 

dt t 

(94.43) 

whence for each mass 

Xi = Vit 

(94.44) 

where i>j is constant. 

Since S 2 > 0 we must have 



x 2 + o 2 + < c 2 

(94.45) 


In the auxiliary Galilean space each mass moves with a constant velocity 
proportional to its coordinates and therefore in this space all distances increase 
proportionately with time. This is, properly speaking, only true for the auxiliary 
Galilean space, but qualitatively the conclusion that relative distances 
increase with time remains valid in the physical space also. Such an expanding 
space is thus the background which here replaces Euclidean space. The 
observable physical consequences of this at first sight paradoxical result will 
be examined in the following section. 

The quantities x,- introduced by equation (94.44) may be considered as the 
coordinates of a given mass measured with a scale increasing with “ time ” 
t. With this scale the coordinates of each mass point are constant. Therefore, 
from the point of view of the equations of motion the Vi are a type of Lagrangian 
coordinates (as distinct from the Eulerian coordinates xi .) We introduce into 
ds 2 as spatial variables the three quantities and as a time variable the 
quantity t which is proportional to S and given by 


= J[“ 


-{x\+x{ + x 2 ) 


(94.46) 


Our transformation is then 
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Evidently relation (94.36) is identically satisfied. We have 

dx* = dr + - - T U tk I- 

V(i - v i°) VO — ,r / c “) \ c — v~/ 

and 


(94.47) 


(94.48) 


whence 


dt = 


1 


VO - 


dr + 


T Vi dl'i 

VO- - v °ic 2 ) c 2 - v 2 


(94.49) 


where 


c 2 dt 2 — (dx 2 + dx 2 + dx 2 ) = c 2 dr 2 — t 2 da 2 


7 2 (dv) 2 — [v x dv] 2 /c 2 

= (1 - v 2 /c 2 ) 2 


(94.50) 

(94.51) 


This is the square of the element of arc length already known to us from the 
Lobachevskv-Einstein velocity space of Section 17. In place of (94.51) we can 
also write 
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dr ‘ l = i L v *: c i ( S ‘* + dv * dv * ( 94 - 52 ) 

If instead of A we introduce a constant a such that 

S = ct ; A — ca. (94.53) 

we obtain for ds 2 

ds 2 = (l - “j '(c 2 dv 2 - t 2 da 2 ) (94.54) 


The quantity r may be called the proper time in the auxiliary Galilean space. 
The physical proper time on the other hand is the quantity T determined by 
the equation 

“) 2 d T = T-- -2«log- (94.55) 

t / t a 



Here the convention is to count time from that very distant epoch (some 
thousands of millions of years ago) when the galaxies observed to-day were 
much closer together than they are now. There is no sense in applying the 
above equation to yet earlier epochs because it ignores the direct gravitational 
interaction of the galaxies which could have been considerable at that time. 

The spatial part of ds 2 ,which corresponds to a fixed value of the universal 
time T (and therefore also to fixed t) is equal to 


dl 2 


- 4 -;) 


da 2 


(94.56) 


This describes a Lobachevsky space of constant negative curvature. Its 
volume is infinite. 

We consider the mass of those galaxies whose velocities in the auxiliary 
Galilean space are within given limits. If p* is the invariant density this mass is 

dM — p* dV (94.57) 


where dV is the volume element in the Lobachevsky space with the metric 
(94.56). As is easy to calculate from (94.52) the volume element in the velocity 
space is 


JT/ _ dv 1 dv 2 dv 3 
° ~ (1 - u 2 /c 2 ) 2 


(94.58) 


Therefore the volume element in the Lobachevsky space is 


and the mass dM is 


dV = - ^dF 0 (94.59) 

dM = p*r 3 (l - ^fdV a (94.60) 


The equation (94.33) for the invariant density may be written in the form 


3c 2 A 
2ny' iS 3 (l - A/S)* 


(94.61) 
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whence, using (94.53) we get 




3a 

2rcy 


Inserting this expression into (94.60) we get 


dM = ~ dV a 

-ircy 


which can also be written as 
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(94.62) 


(94.63) 


3a de 1 dv.,dv 3 
2tt ("l - t> 2 /c 2 )* 


(94.64) 


In the study of the red shift in the spectra of distant galaxies it is customary 
to characterize the shift by the speed v* which would produce it if it could be 
ascribed entirely to Doppler effect in ordinary Galilean space. From the theory 
of the red shift which ■will be developed in the following section it follows that, 
as long as it is small compared to the speed of light, this effective speed is related 
to the speed discussed here by the equation : 


r* —- 


t + a 


(94.65) 


With the aid of this relation the previous equations may be more easily inter¬ 
preted. If we put 

a* - y \z~r:) (94.66) 


equation (94.64) can be written approximately as 

3a* 


y dM 


dr* dr* dv 


(94.67) 


Integrating over solid angle and over the absolute magnitude of the effective 
speed, we obtain the relation 

Y (M - M 0 ) = 2x*();* 3 - v* 3 ) (94.68) 

Hence we can get an estimate of a* from an estimate of the mass M — M a 
of those galaxies which have effective Doppler speeds in the range between 
?;* and v*. 


95. Theory of the Red Shift 

The hypothesis that the structure of very large regions of space containing 
many galaxies is of the nature of the structure of Friedmann-Lobachevsky 
space found unexpected confirmation in the phenomenon of the red shift in 
the spectra of galaxies discovered by the astronomer Hubble [25]. It was 
shown that all the lines of the galactic spectra are shifted towards the red 
end of the spectrum and that this shift is the greater the more distant the 
galaxy. 

In this section we shall give a theory of the red shift based on the hypothesis 
mentioned. 

We first write the expression for ds 2 in the form 
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ds 2 = ^1 — - j (c 2 dt 2 — dx 2 - dy 2 — dz 2 ) 


Here a is a positive constant and t is given by 


T = 



r 2 = x 1 + y 2 + z 2 


(95.01) 


(95.02) 


Expression (95.0 i) actually contains not one constant but two because the 
origin o£ time reckoning or, equivalently, the value of t for-the present epoch 
remains unknown. In principle, the theory of the red shift makes it possible 
to determine these two constants by observation. Also, the theory permits of 
verification if the mean density p* of matter in space is known. 

The change of the frequency of light arriving from a distant star arises as 
a result of two effects : the Einstein effect (Section 51) which is operative 
when light passes from a region with one value of the gravitational potential 
g 00 to a region with another value, and the Doppler effect (Section 13) which is 
operative when the radiating system is in motion. 

The fact that Friedmanm-Lobachevsky space is conformally-Galilean con¬ 
siderably simplifies the discussion of the law of light propagation. For in a 
space having the squared line element (95.01) the law for the propagation of 
a wave front has the form 


1 

c 2 



(95.03) 


which is exactly the same as the corresponding law in Galilean space-time. 
The equations of the characteristics will therefore be the same as in the Galilean 
case ; in particular the surfaces 

r r 

t -- const; t -f - = const (95.04) 

c c 


are characteristics. The entire discussion related to the law of wave front 
propagation and the form of the characteristics can be taken over without 
change from the Galilean case to that of conformally-Galilean space-time. 
Tn particular the whole theory of the Doppler effect remains unchanged. 

After these preliminary remarks we go on to derive the equations for the 
change of the frequency of light. 

We shall use a coordinate system with its origin in the Sun or on the Earth 
(it is immaterial which we choose because of the smallness of the relative dis¬ 
tance of Sun and Earth compared with the distance from other galaxies). 
Assume that at t — t 0 light is emitted by a distant star which then is located 
at the distance r — r 0 . The value of t 0 corresponding to time t 0 and distance 
r 0 is 

T » J(<S ( 95 - 05 ) 

according to (95.02). By the equation of the characteristic, 


(95.0G) 
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this light reaches the Earth (r = 0) at the instant t = t 0 + rjc which corre¬ 
sponds to a value of t given by 



(95.07) 


From this equation and (95.05) we get 


T o //^o ( r o l c )\ 

V V Go + ( r o ! C V 


(95.08) 


On the other hand, if the emitting star moves with the velocity corresponding 
to the mean velocity of matter in Friedmann-Lobachevsky space, then, 
according to (94.34) 

- = v; {v > 0) (95.09) 

to 


where v is the constant speed of the star in the auxiliary Galilean space. From 
(95.08) and (95.09) we obtain 


^0 


T 



(95.10) 


We now consider the frequency change. Let co 0 be the proper frequency 
of the emitter and co* the frequency in the auxiliary Galilean space in the 
frame of reference of the emitter. The two quantities are connected by the 
relation 


1 


w 0 



(95.11) 


The conversion factor is the value derived from (95.01) of the quantity 
V(9oo)lc at the time and place of emission. In the auxiliary Galilean space an 
emitter of frequency co* is moving with speed v away from the Earth. The 
frequency co* received on the Earth is then obtained by the usual theory of the 
Doppler effect and is 



(95.12) 


According to (95.10) this relation may be written in the form 

TCO* = T„CO* 


(95.13) 


it now remains to pass from the frequency co* in the auxiliary Galilean space 
to the frequency co in physical space. The transition is effected by an equation 
analogous to (95.11); but the value of the quantity V^oo)/ 0 has to be taken 
for the place and instant of observation. The relevant equation is 


1 


co 



2 


From (95.11), (95.13) and 195,14) we get finally 


(95.14) 
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6>o 


(95.15) 


Here, m 0 is the proper frequency of the emitter, e.g. of a hydrogen atom, and 
to the frequency of the light reaching the Earth from an identical emitter on a 
distant star. Since t > t 0 and the factor of to in (95.15) is an increasing function 
of t, we have to < co 0 ; thus the frequency decreases and the displacement of a 
visible spectral line is therefore towards the red end of the spectrum. 

In our proof the inclusion of the change of y/g 0Q in passing from the star to 
the Earth corresponds to taking the Einstein effect into account whereas the 
transformation according to equation (95.12) takes account of the Doppler 
effect. 

Let us now investigate the change in the amplitude of a light wave travelling 
from the star to the Earth. The distance of a star can be judged from the energy 
current density which is proportional to the square of the amplitude, if stars 
of the same radiative power are compared. (Novae at their period of maximum 
brightness may be assumed to have such equal power.) 

In a space which has (94.01) for its squared line element the wave equation 
has the form 


and if we put 




dx. 


;)=° 


d;* 
di = — 

v H 


(95.16) 


(95.17) 


where 4>* is a new function, the wave equation may be written as 

HCfy* - = 0 (95.18) 

Here □ is the Galilean d’Alembert operator, so that 

8 2 iL* 

°^ =e ^ (95J9) 

n 

The function II varies incomparably more slowly than ^*, therefore the second 
term in (95.18) may be discarded and the wave equation may be written as 

Cty* = 0 (95.20; 

(this approximation has no effect at all on the expression for the energy current 
in the wave zone, which alone interests us). 

We now take the origin of coordinates to be on the star and consider a 
spherical wave radiating from there. The quantity <J<* will then have the form 

(95 - 2i) 


and therefore 



1 

r(l - oc/t ) 2 


/ 



(95.22) 


The square of the amplitude of ^ will be a quantity proportional to the energy 
current density I. Therefore we may put 


I = 


B 

r 2 (l - <x/t ) 4 


(95.23) 
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where B is a positive constant equal to the energy flux into unit solid angle. 
Indeed, it follows from the expression for ds 2 which appears in spherical polar 
coordinates as 


ds 2 = ^1-j {c 2 dt 2 — dr 2 — r 2 (d& 2 -|- sin 2 Ad'p 2 )} (95.24) 

that the surface element on a sphere about the source isf 

dS = r 2 ^I-j sin A dtp (95.25) 

Hence the flux of energy through an element of the surface S is 

I dS = B sin & d& d<p (95.26) 

which shows that B is the flux through unit solid angle. 

The quantity B is constant in the frame of reference in which the* wave 
equation and its solution (95.22) have been stated. But it must be remembered 
that B is a dimensioned quantity whose value depends on the choice of units 
and of the standards used for its measurement; in the general case the passage 
to a new frame of reference has an effect on the standards. The dimensions 
of an energy flux are energy divided by time, or in other words, action times 
frequency squared. Action is an invariant and the scale of action does not 
change in the passage from one frame of reference to another. Therefore the 
energy current changes in such a transition in the same way as the square of a 
frequency. If the frequency of a quantum emitted on the star and reaching 
some given region of space is taken as the standard of frequency, the energy 
current will be numerically constant. But if one takes in every region of space 
a “ local ” standard of frequency (for instance the frequency of a quantum 
corresponding to the same spectral transition but emitted in the region in 
question and not on the star) the numerical value of the energy flux will 
change in the ratio w 2 : co 2 . Naturally measurements made on the Earth 
employ “ terrestrial ” and not “ stellar ” standards, and this corresponds to 
the second point of view. Therefore, expressing the energy flux and its density 
in “ terrestrial ” units we must put} 

B = B 0 — (95.27) 

“o 


and 


I = 


B q oi 2 


(95.28) 


w 2 r 2 (l — oc/t) 4 

where B 0 is the value of the energy flux into unit solid angle near the star; 
for stars with equal emissive power (i.e. with equal absolute stellar magnitude) 
the value of B 0 is the same. Putting 



(95.29) 


where R is the “ distance ” determined by the apparent stellar magnitude 
j Of course, the quantity & in (96.25) has nothing to do with the S of (94.02). 

} This result agrees with the conclusion of Landau nd Lifshitz [23], which they reached 
from other considerations. 
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(brightness) we have 



(95.30) 


In this equation r is the distance, in the auxiliary Galilean space, from the 
star to the Earth in the frame of reference attached to the star. We express 
r in terms of t and t 0 . In the frame of reference mentioned we have 


for t — t 0 ; r = 0 and t = t„; 

for t = t 0 + -; r — r and t = 

c 

y /-j-2 ^2 

whence - = —--- 

c 2t 0 

Inserting this value of r into (95.30) we get 

- = M ° t2 ~ T o (l _ *' 

C CO 2t 0 \ T, 

Also, equation (95.15) gives 




6>o _ T (1 — <x/t) 2 

CO _ T 0 (1 - a/T 0 ) 2 


(95.31) 

(95.32) 


(95.33) 


(95.34) 


The values of co 0 /co and of R for different galaxies are connected by the relation 
which can be obtained from these two equations by eliminating t 0 . To perform 
this elimination conveniently we introduce another auxiliary quantityt y in 
place of t 0 to which it is related by 


( T - ^o)(t + a) 
t(t 0 — a) 

(97.35) 

and To _x(T + a + a 2/ ) 

t + a + xy 

(95.36) 

For various galaxies t 0 may vary within the limits 

« < T 0 < T 

so that the limits of y will be 

(95.37) 

0 ^ y < oo 

(95.38) 

Inserting the value of v 0 into (95.34) we get 


co 0 (t + a + t 2 /)(v + a + ay), 
w “ (t + a) 2 

(95.39) 

til, , TX 

or — == l + y + , , u y 2 

to (t + a) 2 

(95.40) 


Also, inserting t 0 from (95.36) and co 0 /co from (95.39) into the equation (95.33) 

t There should be no danger of confusion between this quantity and one of the 
coordinates in (95.01) and (95.02). 
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We put 

(95.41) 

(95.42) 

and recall relation (94.52) for the invariant density p*, by which 

* 3a 

(95.43) 


^ 2tcy (t — a) 6 

Then we have 

2 p* TOC 

3^ h* ~ (t + a) 2 

(95.44) 


The quantity h has the dimensions of inverse time and the quantity (95.44) 
is a dimensionless number. Keplacing t and a in (95.41) and (95.39) by these 
quantities we get 


Eh 

c 

eo 0 — CO 


= y + \y* 

2 p* . 

= y + 


(95.45) 


These equations embrace the whole theory of the red shift. 

The general trend of the dependence of the shift (co 0 — cn)/co on distance is 
the following. 

For small shifts (small y) we can take the right-hand sides of the two equations 
to be equal. Then we have 

c ?‘—? = Rh (95.46) 

co 


and the relation between the red shift (co„ — <o)/co and the distance R is linear. 
The coefficient of proportionality in this linear relation may be determined 
from observation and this gives a value [25] for the reciprocal of h, of the 
order 

7 = 4 x 10 9 years (95.47) 

n 

(The earlier literature gave a value of 2-10 9 years for Ijli ; the difference is due 
to an inaccuracy in the earlier estimates of the distance R.) The value (95.47) 
corresponds to the following value for h 

h = 8 X lO-^sec- 1 (95.48) 

For large shifts, i.e. very large distances, the nonlinear terms in equations 
(95.45) must have an effect. Their influence must show itself in the fact that 
the shift increases with distance more slowly than initially ; the quantity h 
determined from (95.46) by observations on the furthest galaxies must prove 
to be somewhat smaller than if it is determined from observations on nearer 
galaxies. At the greatest distances, which probably cannot be observed in 
practice, the relation must again tend towards the linear, but with a smaller 
factor of proportionality. 
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In astronomy it is customary to call the left-hand side of (95.46) a speed 
and to express it in kilometres per second. However, it is necessary to remember 
that although this quantity has the dimensions of a speed it is not the speed 
of motion of anything. This is evident, for instance, from the fact that if 
« < |w 0 its magnitude exceeds the speed of light. Also the quantity c(<o 0 — co)/ti> 
is not a Doppler speed oven if the whole shift is interpreted as the result of a 
Doppler effect in Galilean space. It follows from the ordinary theory of the 
Doppler effect (see (95.12)) that the effective Doppler speed would be the 


quantity 




O)? — CO 2 

c _2__ 

« 2 -fee 2 


(95.49) 


which is close to the quantity on the left-hand side of (95.46) only if the shift 
is small. 

The equations (95.45) are rigorous and give the dependence of the shift on 
distance for any value of the former. It is possible that in time the accuracy 
of observations will increase so much that their analysis on the basis of these 
equations will make it possible to study the coefficient of y 2 in the second 
equation of (95.45). This will then make it possible to obtain from observations 
on the red shift not only the constant h (known as Hubble’s constant) but also 
the mean density p*. At present, however, p* can only be determined quite 
roughly by counting the number of galaxies in sufficiently large volumes of 
space on the assumption that each galaxy has the same mass as our Galaxy 
(of the order of 10 u solar masses or 2 X lO^g). Such an estimate gives a density 
of the order 


p* = 4 x 10 -29 g/cm 3 


(95.50) 


A more vivid idea of the magnitude of this density can be given as follows: 
The volume of the terrestrial globe is 1-08332 x 10 27 cm 3 or, roughly, 10 27 cm 3 . 
In space a volume of this size contains in the mean 0-04g of matter, f 
If p* and h are assumed to be known the value of the dimensionless constant 

b = 87ryp * (95.51) 

3/j 2 


may also be determined, 
which may be written as 


and in the relation (95.44), 


This constant appears both in the equation (95.45), 


Ilh 

c 

(o 0 — o> 
which gives 


, 1 2 
y + ^y 



(95.52) 


4ra 


(95.53) 


If we take as values 

h -- 8 x 10-' s sec -1 

p* ; 1 >, l(r ;9 g/cm 3 (95.54) 

o 

y ■= - X 10~ 7 cm 3 /(g. sec 2 ) 

3 

f We stress once again that the quantity p* is known with very small accuracy ; its true 
value may be many times greater or smaller than the value given. 
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we get 

b = - = - (95.55) 

9 3 

It is then possible to use (95.53) to determine the ratio t/ a, namely 

- = 10 (95.56) 

a 


Hence and from (95.42) we have 



fix 1-1 =1-50; hex.- 0d5 

(0-9) 3 

(95.57) 

and, using the value of 1/h given by (95.47) we get 



t = 6x 10 9 years 
a = 6 x 10 8 years 

(95.58) 

and from (94.45) 

T — 3-2 x 10 9 years 

(95.59) 


The following should be borne in mind. The ratio t/oc given by equation (95.53) 
is real only if b is less than unity, i.e. if for given h the density p* is sufficiently 
small, as in fact appears to be the case. There exist also solutions of Einstein’s 
equations corresponding to b > 1, but for these the conformally-Galilean 
coordinates we have used are inapplicable because they become complex. 
It is true that one can introduce other coordinates that are real, but the expres¬ 
sion that one gets for ds 2 when b > 1 leads to such strange notions concerning 
the properties of space and time (a finite volume varying periodically with 
time, etc.) that one can hardly ascribe physical meaning to this case. There¬ 
fore we shall not discuss the solution for the case b > 1. (This solution was 
also found by Friedmann.) 

As for the solution we have discussed, which corresponds to b < 1 (the 
Friedmann-Lobachevsky space) the following remarks should be made con¬ 
cerning it. 

In the first place it is incorrect to see in it some kind of a model of “ the 
Universe as a whole ” : such a point of view is unsatisfactory in a philosophical 
respect. At most Friedmann-Lobachevsky space can serve as the background 
for a limited number of galaxies just as Galilean space serves as a background 
for objects such as the Solar system. 

The actual applicability of Einstein’s equations in their classical form to 
such enormous dimensions is not so well established as in the case of phenomena 
on a more limited scale. It is not out of the question that on a cosmic scale 
these equations will require modification and generalization. 

Among the consequences of the solution that we have discussed the most 
paradoxical is probably the relative shortness of the time T which has elapsed 
since the galaxies were close together. This time is comparable to the age of the 
Earth’s crust as determined by the radioactive decay of uranium, while it 
would have been natural to expect the “ cosmic ” scale of time to be much 
greater than the “ terrestrial ” 
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The fact of the “ dispersal ” of the galaxies, in all its unexpectedness, is so 
well established, however, that one need not have any doubts about it. The 
number of binary stars which is abnormally large from the point of view of 
stellar statistics, also indicates that in the past the density not only of galaxies 
but also of stars was probably much greater. 

In judging the theory it is also necessary to bear in mind that it is a. successful 
attempt to resolve the Seeliger paradox referred to in Section 94 which arises 
when one studies a uniform mass density in Euclidean space. 

It is therefore beyond doubt that the theory due to A. A. Friedmann which 
we have described is an important step in the study of space on a cosmic scale. 

96. The Development of the Theory of Gravitation and of the Motion of 
Masses (A Critical Survey) 

Einstein’s basic paper that laid the foundations of the theory of Galilean 
space and time made its appearance in 1905 under the heading “ On the Electro¬ 
dynamics of Moving Bodies ”. This paper linked together for the first time the 
principle of the constancy of the speed of light and the principle of relativity, 
according to which corresponding physical processes exist, and proceed identi¬ 
cally in any two inertial frames. To make these two principles compatible it 
proved necessary to re-examine the notions of space and of time and this led 
to a theory of a unified space-time. This theory was given the name of “ Theory 
of Relativity ”. It is worth noticing that this name did not yet appear in the 
title of Einstein’s paper. 

We know now that the principle of relativity reflects the uniformity of space- 
time: either the total uniformity of Galilean space, or —in a generalized theorv 
uniformity at infinity, which under certain conditions allows one to introduce 
harmonic coordinates. In both cases the principle of relativity is expressed by 
the existence of the group of Lorentz transformations within a definite class 
of coordinate systems. 

During the period when Einstein created his theory of relativity (before his 
theory of gravitation) the idea that it is possible for physical processes .to 
influence the space-time metric had not yet arisen; in other words, it was then 
tacitly assumed that the metric was rigid. But if this assumption is made, the 
principle of relativity expressed in its most general form (in arbitrary coordi¬ 
nates) leads to uniform space, and if one makes the additional assumption that 
there is no absolute scale, it leads just to Galilean space (Section 49*). 

The basic idea of the theory of relativity consists precisely in establishing 
the metr. of space-time and in pronouncing the requirement that the form of 
the laws of physics should correspond to this metric; since the metric expresses 
the uniformity of space-time the form of the laws of physics must take into 
account this uniformity. This requirement means that the equations expressing 
physical law's must be covariant with respect to Lorentz transformations. The 
requirement mentioned has a very great heuristic significance because it severely 
limits the possible form of physical laws. It is, therefore, not surprising that 
the applications of the Theory of Relativity affect all branches of physics and 
are too numerous to list. 

In the period between 1905 and 1915 Einstein created his Theory of Gravita¬ 
tion. In 1916 he published his fundamental paper under the heading “The 
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Principles of the General Theory of Relativity ”. This title reflects Einstein’s 
point of view regarding his theory—a point of view which we cannot accept as 
correct. As we have already shown in the Introduction, the basic idea 
of Einstein’s theory of gravitation is by no means that the principle of 
relativity is generalized for accelerated motion—such a generalization is 
not possible. The basic idea consists in assuming that physical processes can 
influence the space-time metric and in recognizing the unity of metric and gravi¬ 
tation. These ideas could be formulated with help of the formalism of 
Riemannian geometry. As to the relativity principle, one may say that it does 
not play any essential part in the theory of gravitation. The concept of 
“ relativity of motion ” is restricted rather than widened in this theory. This 
is quite natural since the inclusion of the gravitational field in the metric field 
destroys the uniformity of space-time, which is only locally restored by the 
application of the equivalence principle. The local analogue of an inertial frame 
of reference, the “ freely falling ” frame, is only an approximate realization of 
the former. On the other hand, the harmonic coordinates, though not local, are 
only partly analogous to inertial frames. 

The' fact that the theory of gravitation, a theory of such amazing depth, 
beauty and cogency, was not correctly understood by its author, should not 
surprise us. We should also not be surprised at the gaps in logic, and even 
errors, which Einstein permitted himself when he derived the basic equations 
of the theory. In the history of physics we have many examples in which the 
underlying significance of a fundamentally new physical theory was realized 
not by its author but by somebody else and in which the derivation of the 
basic equations proposed by the author proved to be logically inconsistent. 
It is sufficient to point to Maxwell’s theory of the electromagnetic field. This 
theory actually put an end to the conception of mechanics as the basis of 
physics, but its author and also Hertz, who did so much in verification of the 
theory, wholly accepted a mechanical picture. Only Lorentz established with 
full clarity the physical sense of Maxwell’s equations, showing that the electro¬ 
magnetic field itself is a physical reality (i.e. is itself material), being capable 
of existence in free space and not requiring a special carrier.! 

As for the cogency of the derivation, we can recall that Maxwell [27] prefaces 
the derivation of his famous equations by the exposition of some chapters in 
mechanics and bases the derivation itself on Lagrange’s equations (of the second 
Lind). We know now that from the point of view of logic, a derivation of 
Maxwell’s equations on the basis of mechanics is not possible. But great, 
and not only great, discoveries are not made by the rules of logic, but by guess¬ 
work, or in other words by creative intuition. 

It is interesting to trace the path along which Einstein reached his equations 
of gravitation. This can be done by using Einstein’s fundamental paper of 
1916 together with his other scientific work and his autobiography published 
in 1949 on the occasion of his seventieth birthday [26]. 

As early as 1908 it became clear to Einstein that any attempt to introduce 
gravitation within the framework of the theory of relativity in Galilean space 
could not be successful for the following reason. Such a theory predicts a 

t It is interesting that this example of an author not fully understanding the physical 
meaning of his own theory is given by Einstein himself. {Autobiography [26].) 
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dependence of the inertial mass on the internal (and kinetic) energy of a body, 
but cannot give a dependence of gravitational mass on energy as long as the 
gravitational equations are taken to be linear. At the same time one has the 
fundamental fact that inertial and gravitational masses are equal, as is proved 
by the fact that the acceleration of a freely falling body is independent of its 
composition. Einstein interpreted this fact in the sense of the local equivalence 
of fields of acceleration and of gravitation. 

Hence Einstein concluded that if “ accelerated frames of reference ” are 
accepted gravitation and acceleration become indistinguishable. In his Auto¬ 
biography Einstein writes: “If therefore gravitational fields of arbitrary 
extension, not restricted a priori by spatial boundary conditions, are regarded 
as possible the concept of an inertial frame becomes completely empty. The 
notion of ‘ acceleration relative to space ’ then loses all meaning ; so does the 
Principle of Inertiaf and with it Mach’s paradox.”! 

Further, Einstein concludes from the equivalence of all reference frames 
(inertial and non-inertial) that the gravitational equations must be covariant 
with respect to arbitrary coordinate transformations. 

Such were the ideas that represented Einstein’s starting point in 1908. 
Let us analyse them. In the first place there is here an incorrect initial assump¬ 
tion. Einstein speaks of arbitrary gravitational fields extending as far as one 
pleases and not limited by boundary conditions. Such fields cannot exist. 
Boundary conditions or similar conditions which characterize space as a whole 
are absolutely essential and thus the notion of “ acceleration relative to space ” 
retains its significance in some form or another. As for Mach’s paradox, it is 
based, as is well known, on the consideration of a rotating body which has 
ellipsoidal form and a non-rotating body which is spherical. A paradox arises 
here only if one denies a meaning to the notion of “ rotation relative to space ” ; 
both bodies, that which rotates and that which does not, then appear to be 
equivalent and it cannot be understood why one of them is spherical and not the 
other. But the paradox vanishes as soon as one accepts the legitimacy of the 
notion of “ acceleration relative to space ”. 

The essence of the error committed in the initial assumption consists in 
forgetting that the nature of the equivalence of fields of acceleration and of gravitation 
is strictly local. Related to this is the fact that a non-local physical definition 
of an accelerated frame of reference is impossible, because a box, rigid scaffolding, 
etc., of the kind used by Einstein represents an idealization which is only 
applicable in inertial frames of reference, and not in accelerated frames. But 
physically the “ general principle of relativity ”, in the sense that corresponding 
processes exist in arbitrary reference frames, does not hold at all. Therefore 
Einstein’s conclusion that all reference frames are physically equivalent, is 
without foundation. But this is the assumption on which Einstein bases his 
further arguments, in particular his derivation of the covariance of the required 
equations of gravitation. Naturally it does not follow from the incorrectness of 
his assumptions that the conclusions drawn are incorrect: the equations of 
gravitation are indeed covariant, but their covariance is not their distinctive 

f Newton’s First Law (V.A.F.). 

{ The above quotation is a direct translation from the German text by N.K. 



Approximate Solutions 395 

property; it could be achieved with any theory. Thus in Chapter IV of this 
book we formulated the ordinary theory of relativity in a covariant manner; 
it is interesting that Einstein himself points out the possibility of such formu¬ 
lation [26], a fact to which we shall return shortly. 

Thus at the first stage of constructing a theory of gravitation Einstein 
accepted as an initial assumption the equality of inertial and gravitational mass, 
the principle of the equivalence of acceleration and gravity and the require¬ 
ment of covariance for the equations. The idea of Riemannian geometry and 
the idea of the unity of metric and gravitation were as yet absent. 

The requirement that the equations be covariant played a very large part 
in the construction of the theory. The circumstance that Einstein “ derived ” 
it from the principle of equivalence and ultimately from the equality of inertial 
and gravitational mass shows that at the time he was creating his theory 
Einstein considered the general covariance of the equations to be a specific 
peculiarity of the theory of gravitation. This is said explicitly in the intro¬ 
ductory part of the fundamental paper of 1916. Subsequently it was pointed 
out to Einstein that general covariance by itself does not express any.physical 
law and, apparently, he agreed with this. But his agreement was rather formal, 
because in fact to the end of his days Einstein connected the requirement of 
general covariance with the idea of some kind of “ general relativity ” and with 
the equivalence of all frames of reference. He never realized the difference be¬ 
tween physical equivalence (or physical relativity) in the sense of the existence 
of corresponding processes in different frames of reference and that formal 
equivalence which consists in the possibility of using- arbitrary coordinate 
systems independently of whether corresponding physical processes do or do 
not exist in the systems concerned. In the theory of Galilean space-time 
physical relativity is directly connected with the uniformity of the space. In 
the theory of gravitation, where space-time, is non-uniform and its metric de¬ 
pends on the processes occurring in it, things are somewhat more complicated. 
But here also the existence of a kind of relativity within a class of harmonic 
coordinate systems is a result of the uniformity of the space at infinity (see 
Sections 49* and 93). Thus in all cases where there is physical relativity, it is 
connected with uniformity, i.e. with objective properties of space-time and not 
at all with the use of this or that coordinate system. (The choice of coordinate 
system has an effect only on the simplicity of the way physical relativity can 
be expressed: it is expressible most simply in such coordinates in which physically 
equivalent systems are related by linear transformations.) The mathematical 
expression for physical relativity does not reduce to the covariance of differential 
equations: the latter is a necessary but not a sufficient condition for the existence 
of the former. But Einstein identified physical relativity with the covariance 
of differential equations, actually believing that general covariance is an ex¬ 
tension of the concepts of the physical relativity that formed the basis of his 
1905 theory. This is the origin of Einstein’s conviction that there exists a 
“ General Principle of Relativity ”. This conviction evidenced itself in the fact 
that he called his theory of gravitation “ The General Theory of Relativity 
and that in later years he stubbornly adhered to this term. He then named his 
previous theory, which operated in Galilean space, the “ Special Theory of 
Relativity contrasting it with his “ General ” Theory. The contusion of the 
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concepts of physical relativity and formal covariance is particularly clearly 
shown at one point in Einstein’s autobiography where he himself formulates 
the “ special ” theory of relativity (i.e. a theory in which, according to Einstein 
there is no “ general relativity ”) in the general covariant form, which, according 
to Einstein, expresses the idea of “ general relativity 
Thus in the period immediately preceding the creation of the theory of 
gravitation the idea of general covariance appeared to Einstein as a physical 
idea and it guided him in his researches. 

The next important step in the creation of the theory was made when 
Einstein introduced the generalized expression for the square of the interval, 
i.e. when he, in fact, introduced the hypothesis of Riemannian geometry for 
space and time. Einstein approached the idea of Riemannian geometry from 
the direction of requiring the general covariance of the equations. He made 
use of the fact that in the “ absolute differential calculus ” (tensor analysis) 
which by that time had been fully developed by Ricci, Levi-Civita and others, 
equations appear from the start in generally covariant form. Thus the require¬ 
ment of general covariance played an important heuristic role, in spite of the 
fact that the essence of the matter lay not there, but in the new hypothesis 
concerning the character of space-time geometry .f 

In solving the problem of the form of the gravitational equations an important 
step was made by Einstein in assuming that purely “ geometrical ” quantities 
namely the coefficients g in the expression for the squared interval should be 
considered to play the part of the gravitational potentials, and that no other 
quantities need be introduced. From among the multitude of possible non¬ 
linear generally covariant equations this assumption served to single out almost 
uniquely one set of equations. To achieve full uniqueness some supplementary 
considerations had to be introduced, such as the requirements that the mass 
tensor be conservative and that in the absence of masses there should be a 
solution corresponding to Galilean space. 

These two ideas the idea of generalizing the metric and the idea of the unity 
of metric and gravitation were decisive. Operating with them, Einstein was 
led to his equations of gravitation 

K, -kv« = (<».oi) 

which contain the essence of this theory and represent one of the greatest 
achievements of human genius. These equations are given in Einstein’s 1916 
paper. 

Subsequently Einstein made attempts to alter these equations but the 
original form remains the only correct one, provided only the meaning of 
the mass tensor 7’ (tv is interpreted correctly. We have in mind the circumstance 
that in the cosmological case, the mass tensor is determined only apart from a 
term proportional to the metric tensor: the factor of proportionality in that 
term characterizes not only the mass tensor itself but also the properties of 
space as a whole (the factor plays the part, of a cosmological constant). Einstein 
also attempted to find equations not involving a mass tensor for some “ uni¬ 
versal field ” ( Gesamtfdd) that was to include gravitation: apparently lie con- 
f Here we use the word “ geometry” in the sense of objective properties of space and time, 
and not in the sense of the science of these properties. 
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nectea these attempts with nis hopes of creating, independently of quantum 
theory, a theory of elementary particles that were to be the singularities of the 
field. These attempts ended in failure and we only mention them because the 
idea of considering singularities of the field in place of the mass tensor was 
also pursued in the work of Einstein and his school on the derivation of the 
equations of motion from the gravitational equations. 

Chronologically the first of these papers were those of Einstein and Grommer 
and of Einstein, published in 1927 [28, 29]. In this work the following question 
was posed and solved in principle. If the masses are considered as singularities 
of the field, is it possible to prescribe arbitrarily the motion of these singularities, 
without violating the gravitational equations? This question was answered in 
the negative: it emerged that the motion of the singularities could not be 
prescribed, because it is a consequence of the gravitational equations. Hence 
it follows in particular that the principle of the geodesic is not independent 
of the gravitational equations but is a consequence thereof. It is of importance 
to note that this result is conditioned by the non-linear character of the 
gravitational equations. 

In spite of the enormous importance of this result the two papers of Einstein 
just mentioned aroused little interest and more than ten years passed before 
they were followed up. 

The development of the ideas laid down in the paper by Einstein and Grommer 
in 1927 began in the years 1938-39 quite independently in two directions. 
The work of Einstein, Infeld and their collaborators is concerned with the one 
development, the work of Fock and his collaborators with the other; the 
papers of Papapetrou, which appeared after 1950, can also be counted as 
going in the second direction. 

We examine first the work of the Einstein school. 

In a series of papers by Einstein, Infeld and Hoffman (1938) [30] and by 
Einstein and Infeld (1940) [31], (1949) [32], a method was elaborated which made 
it possible to derive from the gravitational equations and to write in explicit 
form an approximation to the equations of motion for pointlike singularities 
of the field, which represent spherically symmetric bodies of infinitesimal size. 
The equations were obtained both in the first (Newtonian), and in the following 
approximation. The authors were guided by Einstein’s idea of the undesirability 
of introducing a mass tensor and therefore started with the gravitational 
equations with a vanishing right-hand side. (Incidentally, Infeld showed in 
1954 that the mathematical work becomes incomparably simpler if, from the 
start, one introduces a mass tensor involving Dirac delta-functions, corres¬ 
ponding to spherically symmetric singularities.) In attempting to find an 
exact solution of the mechanical problem of the motion of the field singularities 
the authors also introduced infinite series in powers of a parameter inversely 
proportional to the speed of light. 

The interpretation of the results of the work of Einstein’s school is made 
difficult by the fact that the coordinate system is not specified from the start 
(in the way we specified the harmonic coordinate system), but instead is deter¬ 
mined from approximation to approximation by means of various supple¬ 
mentary conditions which are sometimes introduced explicitly and sometimes 
implicitly. In general, however, one can say that the coordinates that are in 
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fact employed in this work differ so little from harmonic coordinates, that the 
differences have no effect on the equations of motion in second order. If one 
denotes the harmonic coordinates by (t, xt) and the coordinates used by Einstein 
and Infeld by (/', x/) then in orders of magnitude we have 

x ’ = X{ + 0 (^-), *' = <+0^j (96.02) 

where L and q are the parameters we have used repeatedly to characterize the 
orders of magnitude of length and velocity. This question is discussed in more 
detail in the author’s paper [ 45]. 

In those cases when approximations higher than the second are discussed 
another question arises, in addition to that of the indeterminateness of the 
coordinate system. It becomes unclear whether the formal solution obtained 
has physical sense. This doubt is due to the fact that the authors of these 
papers do not impose the condition of outward radiation and, in fact, make the 
contrary assertion that there exist certain (unknown) coordinate transformations 
which reduce the exact equations of motion to Newtonian form, corresponding 
to a strictly conservative non-radiating system. Both parts of the latter asser¬ 
tion, namely the possibility of reducing more general equations of motion to 
Newtonian form and the possibility of reducing a non-conservative (radiating) 
system to a conservative one, seem doubtful. 

The 1927 paper by Einstein and Grommer is also the starting point of a series 
of papers in the other direction. 

In our paper [34] published in 1939 we studied the problem of the motion of 
spherically symmetric non-rotating bodies of finite size and derived equations 
of motion for them, starting from the gravitational equations. (See also [36].) 
The particular characteristics of the method used by us are on the one hand 
the study of the mass tensor, which is determined in parallel with the gravita¬ 
tional potentials, and on the other hand the use of harmonic coordinates : our 
calculations were so conducted that the equations of motion were obtained 
from the harmonic conditions. In addition we obtained the gravitational 
potentials in explicit form (see Sections 82 and 83 of this book). 

The fact that harmonic coordinates are important in principle was already 
noted in our paper of 1939 and it was discussed more fully in the 1947 paper 
[35], where we pointed out the connection between the question of the unique¬ 
ness of the coordinate system and that of the preferred status of the Copemican 
heliocentric system. 

In our 1939 paper the equations of motion were obtained only in Newtonian 
approximation, but it was mentioned that the work of Petrova would be 
devoted to the second approximation. That work was indeed completed in 
1940 but published only in 1949 [37]. The equations of motion for a system of 
finite masses in second approximation obtained by Petrova were used by us 
in our 1941 paper [38] on the integrals of motion of the mass centre. 

In 1954 Kashkarov [39] used the same method to derive from the gravita¬ 
tional equations the equations of motion of rotating masses in Newtonian 
approximation. 

In 1950 Fichtenholz [40] reduced the equations of motion in second 
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approximation to Lagrangian form and hence derived their integrals. 

In his 1951 paper [41] Papapetrou, independently of Petrova, continued 
our 1939 work and derived from it the equations of motion for spheri¬ 
cally symmetric non-rotating masses in second approximation. However, in 
Papapetrou’s derivation there is an inconsistency of approach in that he 
chooses a priori a weighting function when averaging the equations of motion 
of a continuous medium over the region of each mass. The choice that Papa¬ 
petrou implicitly makes is justified only at the end of the calculation by a 
direct verification of the harmonic conditions. The same remark applies to a 
more recent paper by Meister [42]. 

In the domain of the theory of masses in motion a number of new results 
are derived in this book. We shall not speak here of these or of our results 
in other domains of the theory of gravitation, referring the reader to the main 
text. We merely note that for the case of rotating masses of finite size we 
obtain not only the equations of motion themselves but also their integrals. 

In all work on the theory of the motion of masses- -both the work of Einstein 
and his school and our own— the case of an “ insular ” distribution of masses is 
discussed, i.e. just the case relevant to our general conclusions on the possibility 
of introducing a harmonic coordinate system. Therefore the comparison of the 
two sets of papers gives a vivid illustration of the difference between Einstein’s 
point of view and ours on the question of coordinates and on the fundamental 
principles of the theory. Apart from these general matters the treatment of 
some concrete questions connected with the statement of the general problem 
is also different; while in Einstein’s work the masses are thought of as points, 
we consider extended masses with a definite internal structure. Einstein's 
interest in point masses appears to be connected with his aforementioned a ttempts 
to construct a theory of matter with particles as singular points of the field. 
Einstein’s view' of the mass tensor as an “ undesirable element ” in the theory 
is connected with these same ideas. In contrast to this view, we consider the 
mass tensor as a necessary element of the theory and attach special significance 
to its unique definition and to the examination of the conditions in which such 
a definition is possible. 

Finally, on the question of the radiation of gravitational waves, we consider 
that although in ordinary astronomical conditions these waves do not plav any 
noticeable part, they do nevertheless exist in principle. Therefore in a rigorous 
statement of the problem of the motion of masses w r e would have to include the 
w'ave part of the gravitational field: the. problem is then no longer a problem 
of mechanics in the proper sense. 



CONCLUSION 

We shall attempt a brief exposition of our point of view concerning the theory 
of space, time and gravitation. 

In most ways our standpoint coincides with Einstein’s as usually expounded, 
but on some essential points it is different. 

Space and time must be considered in conjunction. They combine into a four¬ 
dimensional manifold which is uniform in the absence of gravitation and then 
constitutes Galilean space. Galilean space possesses a pseudo-Euclidean 
metric. Its uniformity finds expression in the existence for it of the group of 
Lorentz transformations, which describe the transition from one inertial frame 
of reference to another. The property of uniformity is related to the physical 
principle of relativity, which asserts the existence of corresponding processes in 
any two inertial frames of reference. Therefore the theory of Galilean space 
based on the Lorentz transformations is called the Theory of Relativity. 

Both the property of uniformity and the whole theory of Galilean space may 
also be formulated in arbitrary coordinates. However, the privileged character 
of Galilean coordinates manifests itself in the particular simplicity (linearity) of 
the transformations that relate inertial coordinate systems, that class of 
systems within which the physical principle of relativity holds. 

The physical basis of the theory of gravitation is the law of the equality of 
inertial and gravitational mass and its mathematical basis is the hypothesis 
that space and time possess a Riemannian metric, which is not given in advance 
and may itself depend on the physical processes occurring in the space (on the 
position and motion of masses). Owing to the fact that the description of the 
gravitational field does not involve the introduction of any quantities other 
than the metric tensor, Einstein’s gravitational theory is a realization of the 
idea of the unity of metric and gravitation. The Riemannian geometry used in 
the theory of gravitation studies the local properties of space and, in the general 
case, tells us nothing of the properties of space as a whole. However, in the 
theory of gravitation local treatment is insufficient, since the field equations 
are partial differential equations, the solution of which depends decisively on 
the boundary conditions. In gravitational theory, therefore, one has inevitably 
to introduce some kind of hypothesis concerning the properties of space as a 
whole. The simplest and at the same time most important of these hypotheses 
is the assumption that, at infinity, space is Galilean. Subject to this assumption 
one can impose on the gravitational potentials such supplementary conditions 
that the coordinate system is determined uniquely apart from a Lorentz trans¬ 
formation. This coordinate system, which we call harmonic, stands in strong 
analogy with the ordinary inertial coordinate system. 

The physical principle of relativity, understood as the statement that there 
exist corresponding processes in frames of reference of a certain class, does not 
hold in the genera) case of Einstein space, let alone the “ General Principle of 
Relativity ” within the class of arbitrarv coordinate systems. However, within 
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the class of harmonic coordinate systems a principle of relativity is valid which 
is fully analogous to the principle that holds in Galilean space. In this case tlic 
notion of “ corresponding processes ” must include the distribution and motion 
of the masses that determine the metric. 

The physical principle of relativity is connected, directly or indirectly, with 
the, uniformity of space: either with full uniformity (Galilean space), or with 
uniformity at infinity (space permitting the introduction of harmonic co¬ 
ordinates), or else finally with local uniformity; in the last case the principle 
holds only in the infinitely small. One can achieve local uniformity, i.e. uni¬ 
formity in the neighbourhood of a single point of space, in a geodesic coordinate 
system (the principle of local equivalence between acceleration and gravitation). 
The principle of equivalence thus follows from the Reimannian nature of the 
metric, but it is already contained in Nev'tonian mechanics and is closely 
related to the law of equality of inertial and gravitational mass. 

From the point of view of the theory of gravitation the law of the equality of 
inertial and gravitational mass is valid with all the accuracy with which it is 
possible to define independently the inertial and gravitational properties of mass. 
This law is not of local character and is therefore more useful as a basis for 
deriving Einstein’s gravitational equations than the principle of equivalence, 
which requires the use of a frame of reference in accelerated motion—a concept 
for which there is no satisfactory physical definition. 

Owing to the local nature of the principle of equivalence, it is not a sufficient 
basis for the statement that fields of acceleration and gravitation are indistin¬ 
guishable in finite regions of space, and even less for the statement that there 
exists a “ General Principle of Relativity ”. Such a conclusion from the local 
to the general is as inadmissable as would be, for instance, the conclusion that 
all analytic functions are indistinguishable because they all behave as if they 
were linear, in the infinitely small. 

Since the physical principle of relativity is related to uniformity and since in 
the theory of gravitation the uniformity is confined either to the infinitesimal 
or to infinitely remote regions of space, but does not hold in finite regions and 
at finite distances frpm masses, the theory of gravitation can evidently not be 
looked upon as a generalization of the concept of relativity. On the contrary, 
the theory limits that concept strongly: one can even say that the principle of 
relativity does not play any essential role at all in the theory of gravitation. 
It is therefore incorrect to call Einstein’s theory of gravitation a “ General 
Theory of Relativity ” all the more since “ The General Principle of Relativity ” 
is impossible under any physical conditions. 

The general covariance of equations has quite a different meaning from the 
physical principle of relativity; it is merely a formal property of the equations 
which allows one to- write them down without prejudging the question of what 
coordinate system to use. The solution of equations written in generally co- 
variant form involves four arbitrary functions; but the indeterminacy arising 
from this has no fundamental importance and does not express any kind of 
“ general relativity ”. From a practical point of view' such an indeterminacy 
even represents something of a disadvantage. It seems to us that it is preferable 
to have a mathematical statement of the problem in which uniqueness of the 
solution may be achieved (naturally to the same extent as the essence of the 



402 The Theory of Space Time and Gravitation 

problem permits). 

In our statement of the problem of an isolated system of masses the unique¬ 
ness of the solution is achieved by introducing four supplementary equations, 
the harmonic conditions, and also of boundary conditions, and by passing 
from a local description to a description of space as a whole. In this way the 
uniqueness of the solution is achieved in all the problems considered : in the 
problems of the equations of motion, of radiation, of the form of the rigorous, 
the approximate and the asymptotic solution of the gravitational equations 
and so forth. 

In the case of an isolated system of bodies the question of the coordinate 
system is answered in the same way as in the absence of gravitation : there 
exists a preferred system of coordinates (Galilean or harmonic) but it is also 
possible to use any other coordinate system. As in the Galilean case, the privi¬ 
leged character of harmonic coordinates shows itself in the linearity of the trans¬ 
formations which connect the coordinate systems within the class in which the 
physical principle of relativity holds. This property of harmonic coordinates 
distinguishes them physically from other coordinate sytems; in particular it 
is this property which allows one, as in the Galilean case, to give preference to the 
heliocentric theory of the Solar System over the geocentric theory. 

The existence of a privileged coordinate system is in the first place important 
as a matter of principle, in that it reflects objective properties of space. In 
addition, since this coordinate system is of a standard character, it allows one 
to establish the geometrical meaning of other coordinates and to make com¬ 
parisons between solutions found in other coordinates. Finally, harmonic 
coordinates have great practical significance, because calculations can be 
simplified enormously by their use. 

When discussing regions of space that include many galaxies the notion of 
an isolated system of masses becomes inapplicable and the properties of space 
as a whole must be formulated differently (Friedmann-Lobachevsky space). 
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ON THE DERIVATION OF THE 
LORENTZ TRANSFORMATIONS 


In Section 8 the equations were derived which must be satisfied by the 
transformation function 

x ( = f ( (x 0 , x lt x 2 , x 3 ) (i — 0,1, 2, 3) (A.01) 

in order to ensure that the following two conditions are fulfilled :— 

(a) To a uniform rectilinear motion in the coordinates (xi) there must 
correspond a motion of the same nature in the coordinates (x ' { ); 

(b) To a uniform rectilinear motion with light-velocity in the coordi¬ 
nates (xi) there must correspond a motion of the same nature in the 
coordinates ( x [). 

Condition (b) is equivalent to the condition : 

(b') To the wave front equation 


/ dco \ 2 /3co\ 2 /9co\ 2 / 9w \ 
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(A.02) 


in the coordinates (xi), there must correspond just such an equation in 
the coordinates (x' f ). 

It was shown in Section 8 that functions satisfying condition (a) must be 
solutions of the system of equations 
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whereas functions satisfying (b) or (b') must be solutions of the system 
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From (A.04) we shall derive another equation similar in form to (A.03). 
Differentiating (A.04) with respect to x m and putting 

A = 2A<p m (A.05) 

ox m 

we obtain 
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We now introduce the quantities rj m defined by the relation 
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Equation (A.06) may then be written as 

e l r km + e *r?m = 


(A.07) 


(A.08) 


To find the quantities F£ m in terms of the <p m , we write down two equations 
obtained from (A.08) by cyclic permutation of the indices k, l, m: 


e mT? k + eiT l mk = 2<p*e;8 /m (A.09) 

e k T k ml + e m I» == 2 9l e m 8 mk (A.10) 

Taking their sum, subtracting the original equation (A.08) and using the 
symmetry of Y l km with respect to the lower indices we obtain, after multiplica¬ 
tion with e m 

= 9 k$lm + 9 l$km ~ e m9m e k %kl (A.11) 


Now, equations (A.07) may be solved with respect to the second derivatives 
of/<■ Using (A.04) we get 
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Inserting the value of from (A.ll) we obtain finally 
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Such equations for the transformation functions (A.01) follow from condition 
(b) or (b') that is concerned with motion with the speed of light. On the other 
hand, condition (a) concerning the uniformity of motion leads to equations 
(A.03), and these can also be written in the form (A.12) if we put 


T™ — 4*15*n* + 


(A.14) 


We have seen in Section 8 that if both conditions (a) and (b) (or both (a) 
and (b')) are imposed simultaneously, all second derivatives of the functions 
/( are zero, the quantity X is constant (and can be put equal to unity), and the 
transformation in question reduces to a Lorentz transformation. 

We must now investigate the consequences of condition (a) taken separately 
and of condition (b) or (b') taken separately. 

Equations of the form of (A.12) have been investigated in detail in Section 42, 
where a necessary and sufficient condition for their complete integrability was 
given. This condition has the form 
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It is deducible from the requirement that the various expressions for third 
derivatives of the /< obtainable by differentiation of (A. 12) should all agree. 
Inserting the expression (A.14) into (A.15) we get 



= 0 (A. 16) 


Here we put k # m; l # m; but n — m and get 

th w '' =0 


(A.17) 


Since the value of the index m in (A. 16) is arbitrary the restriction to k ^ m 
and l ^ mis inessential and equation (A.17) must hold for all values of k and l. 
Evidently this equation is also sufficient for (A.16) to be satisfied, for it follows 
from it that 

^ = ^ (A. 18) 


and the first term in (A.16) also becomes zero. 

We now insert the expression (A.ll) into (A.15). In consequence of (A.05) 
we may put 


9*j = 


% 

8x t 



(A.19) 


Evaluating the left-hand side of (A.15) we obtain after multiplication by e m 
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r=o 

3 

Putting ® kl = <p kl - rp k?l + i e k 8 kl 2 e r 9r 

r=o 

®kn e nfilm ®kl e m^nm ®lm e k$kn ^m/Al ® 
Hence it is easy to deduce that necessarily 

**< = 0 


(A.20) 

(A.21) 

(A.22) 

(A.23) 


For, if we multiply the left-hand side of (A.22) by — e m , put m = n and sum 
over m we get 

*>«+«**« 2 = 0 ( A - 24 ) 
m =o 

Multiplying the left-hand side of (A.24) by e*, putting k = l and summing over 
k we get, after division by 6 : 

3 

m=0 


(A.25) 
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Thus the condition for the integrability of (A.03) is (A. 17) and the condition of 
integrability of (A.13) is (A.23). As a result of (A.18) and (A.19) we may put 




(A.26) 


where i]/ and 
<p = log V^)- 

<p are unknown functions (according to (A.05) we 
Then the conditions of integrability become 

can take 


d^i d<\) Sty n 

8x k 8x t dx k 8x t 

(A.27) 

and 

0 2 <p 8<p dcp , 1 _ * / 8<pY n 

8x k 8x t 8x k 8x t 2 k lk r \ 8xJ 

(A.28) 

If here we put 

e~^ = w ; <J> = — log w 

(A.29) 

and also 

I 

II 

8 

-e 

II 

1 

o' 

OQ 

S 

(A.30) 

the new unknown functions are seen to satisfy the equations 



8* w _ Q 

8x k 8x l 

(A.31) 

and 

8hi 1 ^ 1 ^ {du^ 

8x k 8x, 2 kl u a-i e,r \dx ) 

* ‘ r=0 r 

(A.32) 


Equation (A.31) shows that w is a linear function of the coordinates 

w = w(0) + w„x 0 + wqaq + w& 2 + w 3 x 3 (A.33) 


and that therefore 



(A.34) 


where the w* are constants. Inserting these values of the into the equation 
(A.03) for ft we can write the latter as 


S 2 (vfi) 

dx k 8x l 


(A. 35) 


whence it follows that the quantities wft are linear functions. This means 
that the four functions f 0 ,fi,f» and f 3 are ratios of linear functions, all with the 
same denominator w. 

We thus arrive at the following conclusion. The most general form of the 
transformation (A.01) which satisfies condition (a) is a transformation involving 
linear fractions, all with the same denominator. In the particular case when the 
denominator reduces to a constant the linear fractions become linear functions. 
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We now turn to the discussion of equation (A.32) which follows from condition 
(b) or (b'). In the first place we note that for k ^ l its right-hand side is zero. 
Therefore the second derivatives of u with respect to two different variables 
vanish and the function u is additively composed of single variable functions. 
On the other hand, it follows from (A.32) that the double derivatives of w 
with respect to a single variable are all equal, apart from a sign 


dhi 

She 

Bhi 

dhi 



II 

il 

= e3 fo! 


(A.36) 


But each of the quantities of (A.36) can either be a constant or depend on 
one variable only (that with respect to which the differentiation is performed), 
they are therefore all equal to the same constant. We call this constant 2 C 
and then have 


and also 


dhi 

^r 2Ce,: 


She 

8x k 8x j 


(k ^ l) 


(A.37) 


(A38 > 

It follows from (A.37) that u is of the form 

u = G 2 e k x\ - 2 2 e k <x k x k + B (A.39) 

k-o k =o 


where the a* and B are constants. Equation (A. 38) only gives a relation between 
the constants namely 

BO = 2 e r a r (A.40) 

r=o 


According to (A.05) and (A.30) the factor X in (A.04) is inversely proportional 
to u*. We can put 


2 


e && 

6< 8x k 8x , 


,T* e * S *i 


(A.41) 


Without restriction of generality we can assume that at the origin, i.e. for 
x 0 — x t = x t = x a = 0, we have - u — 1. This can always be achieved by 
changing the scale of the /« or the x*. With this condition we get 

1; C=ley r (A.42) 

r-o 

and therefore 

3 3 3 

m = 1 — 2 2 ekXkXk + 2 e * a * 2 e i x i 
k*= o *-=o J=o 


(A.43) 
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We replace the quantities <p* in (A.13) by their values 

w* 


<P* = — - 


w* = 


8u 

dxk 


and then get 


w- 


8ft 
8x k 8x^ 


8ft . 8ft 

+ w* — + Wj 


8x, 


8Xy. 


a 

= e * s « 2 




e_M_- 

m m 8x 
m=o m 


(A.44) 


(A.45) 


If u is of the form (A.43) the conditions of intcgrability of the system (A.45) 
are satisfied, and the functions are completely determined by the values at the 
origin of the functions themselves and their first derivatives. Let these values be 


(/<) o — a t ! 



= etfliic 

o 


(A.46) 


Since at the origin u 

= 1, the equation (A.41) gives 


We now put 

3 

2 e i a ik a il = e k^kl 
<=o 

(A.47) 


/< = «< + 2 e r a trf* 

(A.48) 


r*»o 


By virtue of (A.47) these equations can be solved for the f* which are then 
expressed as linear functions of the/<. Evidently the/* satisfy the same system 
of differential equations as the/<; for the system (A.45) it follows from the linear¬ 
ity of the equations, for the system (A.41) it follows from (A.47). But the 
initial conditions for the/* are of the form 


us*-"' (£).-*• 


(A.49) 


Thus if we find a solution of the system (A.43) with initial conditions of the 
particular form (A.49) equation (A.48) gives us the most general solution of the 
system. 

The system (A.45) may be written in the form 


8\uf*) 


'>-f* 82u +e S Veu^L 

x~ Ji 8x k 8x. + Z 6mm 8x m 

i K l m=n m 

and, using (A.37) we get 

8\uf*) 




8x k 8x, 


m-o 


8x k 8x j 


e k^kl 


m=o m 


Here we put 


f* — 


By use of (A. 38) we obtain 
8 2 F { 


1 / 


nsv'r .l i 


v 


8F,\ 


(A.50) 


(A.51) 


(A.52) 
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Arguing in the same way 

as in the discussion of (A.32) we get 
d 2 Fi 


with 

dXi L, - 2f VA 

(A.54) 

— 

2CF, + 2 ^ - 2 c,u 

m= o m 

(A.55) 


It is easy to see from (A.52) that the initial conditions (A.49) for the/< lead to 
conditions of the same form for the F ( . Equation (A.54) together with the 
initial conditions gives 

F i = c i2 e k x l + (A.56) 

k=o 

Insertion of (A.56) into (A.55) allows us to determine the values of the constants 
Ci, namely 

C< = - oc< (A.57) 

With these values for the constants equation (A.55) becomes an identity. 
Therefore we have 

F i = - <* t 2 e k x l ( A - 58 ) 

k= 0 

and 

3 

*< - a < 2 «k x k 

f* =---*=2--— (A.59) 

1 - 2 2 Vk x k + 2 e k <4 I e i x ] 

k =o fc=-o i=o 

We arrive at the following final conclusion: 

The form of the most general transformation (A.01) satisfying condition 
(b) or (b') is obtained from the transformation 

3 

x i~*i2 e*af 

af =-3-^3-3- (A.60) 

1 - 2 2 Wi + 2 e k x \ 2 e i x i 

k= 0 k= 0 1=0 

together with the transformation 

x 'i = «< + 2 e k a «c x Z ( A - 61 ) 

k=o 

where the coefficients atk satisfy condition (A.47). In addition, a change of 
scale is possible. 

Transformation (A.61) is the usual Lorentz transformation, and (A.60) 
bears the name of Mobius transformation. The latter has many remarkable 
properties which, however, we shall not dwell on. 

For us, it is important to note that the requirement that the form of the 
wave front equations be conserved does not by itself lead to the Lorentz trans¬ 
formations since it also allows Mobius transformations. To remove the latter 
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one can make the additional demand that finite values of the initial coordinates 
should lead to finite values of the transformed ones. This additional require¬ 
ment is satisfied only if all the constants a* in the Mobius transformation are 
zero, making the transformation the identity. Instead of this additional require¬ 
ment one can make another, namely, the condition that uniform rectilinear 
motion be conserved (condition (a)); this is how we proceeded in the text. 
Either of these additional requirements leads uniquely to the Lorentz trans¬ 
formation, apart from a possible change of scale. 

It is also of importance that the requirement that the coordinates remain 
finite refers to space-time as a whole, whereas the conservation condition for 
uniform straight line motion is strictly local.f 


f The discussion and conclusions of this appendix follow very olosely those of Weyl [7]. 



APPENDIX B 


PROOF OF THE UNIQUENESS OF THE ENERGY 
MOMENTUM TENSOR OF THE 
ELECTROMAGNETIC FIELD 


For the electromagnetic field in vacuum the state functions are the compo¬ 
nents of the field 

E v E 2 ,E 3 ; H v H 2 ,H 3 (B.01) 

and the first order equations of motion satisfied by them (field equations) are 
Maxwell’s equations 

curl H-= 0, curl E -]-=-- 0 (B.0"2) 

c ot c dt 


together with the supplementary conditions 

div E = 0, div H = 0 (B.03) 

As in Section 23 we shall characterize an infinitesimal Lorentz transformation 


by the quantities 

V x 

Vy 

v , I 


<0 io- > 

c 

£ 

to 

o 

II 

w 3 o -— > | 

j 

| (B.04) 

^23 ~ 5 

(O31 COy \ 

(0 12 = (0 2 J 



Using three-dimensional notation we can write down the change in a field 
component in an infinitesimal Lorentz transformation as 


8E = - [V x H] + [u» x E] 

c 

m = - - [V x E] + [to) X H] 
c 


(B.05) 


Comparison of these equations with the general equation (31*.04) gives us 
the coefficients (31*.09) and at the "same time the operators X lm . We get 


X 10 = —A 01 = H,— -H 
2 dE. 


8E, 


- E„ 


dH. 


+ E a 


dH, 


(B.06) 


and two similar equations obtained from (B.06) by cyclic interchange of the 
indices 1, 2 and 3. Introducing the antisymmetric set of quantities 


tiki = 1-1, if (i, k, V) are an even permutation of (1, 2, 3), 1 

e ilcl = —1, if (i, k, 1) are an odd permutation of (1, 2, 3), j (B.07) 

Siki =- 0, if there are equal suffixes among the (i, k, l) ) 
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we can write down three expressions of the form of (B.06) as follows: 

,B08) 

If one looks upon the set of three quantities Jt 10 , X w and X 30 as a three- 
dimensional vector, one can replace (B.08) by 


r s i 

r 8 

H X — 

— Ex — 

SE 

dH\ 


The “ spatial ” operators X ik with i, k — 1, 2, 3 can be written as 


Xik = -X*i El 


8 

dE/c 


e ‘M + h ' 


8 

w k 



(B.09) 


(B.10) 


In Section 31* we introduced the following equation for the change of a field 
function in an infinitesimal Lorentz transformation 


1 ^ 

s/== 2 2 ( B - n > 
l,m~ o 

Inserting for / in turn / = Ei and / = H { (i --- 1,2, 3) we again return to equa¬ 
tions (B.05). 


Let us now form the equations that express the vanishing of the divergence 
of T> 1 (they correspond to (31.*16)). We get 

V (ZT*° 8E k 8T a0 8H k \ ^ /8T ai 8E k 8T ai 8H k \ 

Z \8El~dT + 8H k 8t ) + ,Z 1 \8E k + W~ k ~8^) ~ 

(B.12) 

where a == 0, 1, 2, 3. These equations must be algebraic consequences of Max¬ 
well’s equations. Let us first write equations (B.02) in the form 


8E k * w 8Hi 

t.l 

dHjc 

8t 


—Z 

i.i 


8xt 

8Ei 

8xi 


(B.13) 


Zkil 


Eliminating time derivatives from (B.12) with the aid of (B.13) we get 

3 / n^rrini cmivn. r\ 3 


^ , dT <xi dT «0, dEfc 3 , dT * i 

Z 1 ~ cem 'W l ) f Z 1 + c * Hk 


8T a0 \ 8H k 


- 


8Ei j 8xi 


- 0 


(B.14) 


These equations must be algebraic consequences of (B.03). Therefore the left- 
hand sides must be of the form 
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Comparing (B.14) with (B.15) we are led to the following relations. In the 
first place the coefficients of dEijdxt for i — 1, 2, 3 must be equal. This gives 

fiT * 2 dT * 3 

~8E[ = HeI = JeI (B ' 16) 

Similarly the coefficients of SHi/dxt must be equal for i - 1, 2, 3. Hence 

8T al 8T “ 2 8T a3 
W[ = ~dH i = ~Ml 


(B.17) 


Secondly, for i k the coefficients of 8 E k \ 8 x j and 8 H k \ 8 xi in (B.14) must be¬ 
come zero, since such terms do not appear in (B.15). This gives us the relations 


Hence if i ^ k 


dT ai 8T*° 

~W k ~ r£H *a//7 = ° 

8T xt dT** 

-— -f- CClik . . ■ — 0 

8H k 8E, 


8 T a< 8 T ak 

+ T5T- - 0 


8 E k 8 E t 
8 T ai 8 T xk 


= 0 


(B. 18) 
(B.19) 

(b!20) 

(B.21) 


8 H k 8 Hi 

If (i, k, /) is some permutation of (1, 2, 3) we easily see from (B.20) that 


and from (B.21) that 


8 E k 8 E t 

02yai 


= 0 


= 0 


8 H k 8 Hi 

Differentiating (B.18) with respect to Ei and using (B.22) we get 


g2ya0 

8 E 18 H 1 


= 0 (f- 1,2,3) 


(B.22) 

(B.23) 

(B.24) 


The same equation can be obtained by differentiating (B.19) with respect to 
Hi and using (B.23). 

Let us now form a system of equations for T 00 . According to (B.24) we have 


g2yoo 


8 E\ 8 H i 

Pirtting « = 0 in (B.18) we get 

8 T ai 


= 0 (l - 1, 2, 3) 


(B.25) 


8 T 00 


8 E k Ctm 8 Hi 


(B.26) 
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On the other hand, equation (B.24) for a 

= i gives 




= 0 

(B.27) 


dEjcdHjc 

Hence 

£2y00 

(k / l) 



8H k 8Hi ° 

(B.28) 

Similarly we obtain 

02JOO 

(k ^ l) 



dEjcdEi 

(B.29) 


Putting a — 0 in (B.19) and replacing k by l we get 

dT oi 8T 00 


8Hi CZiU 8E k 


(B.30) 


Differentiating (B.26) with respect to Hi, (B.30) with respect to E k and equating 
the results we obtain 


g2yoo g2yoo 

be* = ~m* (/fc * l) 


(B.31) 


The system of equations we have obtained for T ao is easy to solve. Since T 00 
is a three-dimensional scalar, it can depend on the field components only through 
the quantities 

5 = *B«, y) = (E'H), £ = (B.32) 

so that 

2’°°=/(5,11,0 (B.33) 


Indeed, this expression is a solution of the equation X ik (Too) — 0. where X ik is 
given by (B.10). Let us now use equation (B.25). 

We have 


and further 


8T 00 

dEic 


„ 8f „ df 

= Etc TT? + H/c 




dr. 


(B.34) 


£)2yoO 

dEtcdHtc 


d l 

dr) 


+ EicH k 


/ gy m 

v/ 


+ K 


8J 

8£,8r\ 



(B.35) 


As a consequence of (B.25) this expression must vanish for k = 1,2, 3. 
Evidently one can always make a rotation of axes in such a way that, say 
for k — 1 the quantities E x and H l vanish simultaneously. Then equations 
(B.25) and (B.35) with k — 1 give 


£ = 0 (B.36) 


and all second derivatives involving differentiation with respect to r t also vanish. 
If one takes this into account, the same equations for k ^ 1 give 


8*f 

m, 


= 0 


(B.37) 
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By virtue of (B.36) the equations (B.28) and (B.29) lead to 

8 2 f r, a 2 / 

a ? 2 ac 2 

Thus the quantity/in (B.33) is a linear function of E, and C Taking this into 
account, we get 

8 2 T 00 df 8 2 T 00 df 


dE\ 05’ 8H\ K 

and then equation (B.31) gives 

a? at; 

Since/is a linear function, we get 

f=«fc + Q +x 

where a and X are constants and therefore 


(B.39) 


(B.40) 


(B.41) 


T»o = ^ (E* + H*) + X 

It 


(B.42) 


Knowing T 00 we can find the remaining components of T*'‘ from equations 
previously derived. Thus, insertion of the values 


gyoo 8T 00 

= «E, _ = aH 


8E 


0H 


into the formula 


(B.43) 

(B.44) 

(B.45) 


po = %X°\T 0 °) 

(see (31*. 18)) gives the energy flux 

T i0 = a[E x H], 

(the Umov-Povnting vector). This quantity may be written in the form 

3 

T l ° — a ^ ZipqEpHq (B.46) 

v.q = i 

Inserting this expression into the formula 

= (IX°*X 0i - 8 <ft )To# (*, k=\,2, 3) (B.47) 

we obtain, after some manipulation 

T ik = ot[-|Si*(E a + H 2 ) — EtEic — HiHk] — XSj* (i, k = 1,2, 3) (B.48) 
Here we have used the relation 

3 

£ ips&kqs — Sitfipq SijSftn (B.49) 

s = l 

It is not difficult to verify that the expressions found for the energy tensor 
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also satisfy all the other equations (B. 16) (B.19) and not only those which were 
used in their derivation. 

If one assumes that for E = 0 and H = 0 one should have 7 1|iv = 0 one gets 
X = 0. If one interprets T' 1 ' 1 as the energy-momentum tensor (and not the mass 
tensor), then, with the customary choice of units, one has a = l/4n. Thus the 
tensor P 17 is determined uniquely and the result agrees with the generally 
accepted formulae (33.23)-{33.25). 



APPENDIX C 


PROOF OF THE UNIQUENESS OF THE HYDRODYNAMIC 

MASS TENSOR 


In Section 32, when we were deriving the relativistic generalization of the 
hydrodynamic equations of an ideal fluid we started from a definite form of the 
mass tensor. We shall show here that the argument can be conducted in 
the reverse order: taking the equations of motion as one’s starting point one 
can derive the mass tensor from them. 

As functions of state one can take the piessure p and the invariant density 
p* (two scalars) and the four components w< of the four-dimensional velocity 
vector. 

The quantities p and p* will be related by the equation of state and the u l 
will be connected by the identity 

3 

^ e*(« 5 ) 2 = c 2 (C.01) 

8-0 


In place of the pressure p it is convenient to take the dynamic pressure P, 
which is related to p and p* by the equation 


v , 
f^P 

J P* 


As a consequence of (32.27) we have 


p = n + 


(C.02) 

(C.03) 


where IT is the potential energy of unit mass. If/is any function of P, p* and 
the u l its variation in an infinitesimal Lorentz transformation will be 


8/=2<W*|| <0-0*1 

i — 0 

Equating (C.04) to the general expression 

1 ** 

8/== § co,*Z<*(/) (C.05) 

t, fc = 0 


we get the following expression for the operators X ik : 


X ik = e ( u* 


d 

(hi 1 



(C.06) 
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Evidently the scalars are not involved in the formula for 8 f and therefore also 
not in the expression for the operators X ik . As for the partial derivatives with 
respect to the u { , it is immaterial in forming the operators X ik whether one 
assumes all four quantities u l to be independent or whether one takes into 
account their relationship (C.01). 

We introduce the symmetrical system of quantities 

a ik = e<8<* - u*u k (C.07) 

c 2 

which possess the properties 

3 

^ ega is u s = 0 (C.08) 

*=o 

and we denote by w l the four-dimensional acceleration vector 


3 


du* 


w * 


«* = V 

Then the hydrodynamic equations may be written in the form 

3 


/ P\ JU 0P 


(C.09) 

(C.10) 

(C.U) 


In addition, it is necessary to take into account the relation 

XT' 

8x = ° ( c - 12 ) 

*=o * 

which follows from (C.01). 

As a consequence of (C.12) we have the equation 

3 

^ e s u 8 w 8 = 0 (C.13) 

S = 0 

and the relations (C.08) show that this equation is consistent with the equations 
of motion fC.101. 


of motion (C.10). 

Y dT^cw ^ar^ap y At P\ 

X-i du s 8x k ^ X* oP dx/e X-i \ c 2 / 

+ x I (“ if + 2 e d + 2 w, (C14) 

Here the right-hand side represents the sum of the expressions (C.10), (C.ll) 
and (C.12) multiplied by the Lagrangian multipliers X*. X< and (P* respectively. 
After introduction of the Lagrangian multipliers equation (C.14) can be looked 


t motion (U.iU). 

We now write down the equations that express the fact that the divergence 
of the mass tensor is zero by virtue of the field equations. We have 


U P\ 

du 8 


{( + S 5 ) 

u k -— 
8x k 
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upon as an identity in tne derivatives with respect to the coordinates. Equating 
first the coefficients of 8u s /8xk and then those of 8P\8x k we obtain 


8Tt* 

8u‘ 



8T tk 

OF 


p\ 

+ «* + Pp*8 S k + j i ik e s u s 

dp* 

~2 x * a +*jp uk 


(C.15) 

(C.16) 


The Lagrange multipliers must be eliminated from these equations. Applying 
the relation (C.08) to the second equation, we get 


1 3 

X< = -t 2 e ° u ’ 
8 = 0 


8T U 

dp* 


(C.17) 


Inserting, this value into (C.16) we can write that equation (replacing k by j) in 
the form 


5—0 


ega' 


sj 


5 = 0 


8T is 

Ip 


(C.18) 


We now multiply both sides of (C.15) by and sum over .s. On the right- 
hand side the factor multiplying q.** becomes zero as a result of (C.08), so that 
the quantities p fk are eliminated. The A* on the other hand enter only through 
the sum (C.18), which is known. Finally, for the A 4 one can insert their values 
from (C.17). As a result we obtain a relation which may be written as 



(C.19) 


This is the required system of equations for the tensor T ik , which follows from 
the condition that its divergence vanishes by virtue of the field equations. 

Now we must take into account that the set of quantities T ik form a tensor. 
This we can do by one of two methods. We can adjoin to (C.19) another system 
of differential equations, namely 

X rs (T ik ) = e t 8 ir T sk - eStJric + e ^ kr T*> - e£ kl T« (C.20) 

where the operator X rs has the value given by (C.06). Alternatively, we can 
write down directly the most general expression for a tensor which is a function 
of the vector u s and of certain scalars. This expression has the form 

T* k = Au<u k + Be t Sik (C.21) 

where the quantities A and B depend only on the scalar p* (or equivalently 
only on P.) A dependence on the invariant of the vector u s need not be taken 
into account, because according to (C.01) that invariant is a constant. 

It is easy to verify that the expression (C.21) satisfies the system of equations 
(C.20). It is a general solution of that system. 

We can now insert the expressions (C.21) for T ik into (C.19). We first cal- 
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culate the internal sura in (C.19), using the relation (C.01). We get 

1 * d Ttr ^A 1 dB\ , 
c 5 2 erUf 3^* ~ ^ 7 2 dp*/ 

Inserting (C.21) and (C.22) into (C.I9) and using (C.08) we obtain an equation 
of the form 


3 


^ cd s (Mu i hik + Nu k $si) = 0 

(C.23) 

8 =0 

or 


Ma}*u l + 2Vo*%* = 0 

(C.24) 

where we have put for brevity 

„ , J4 p* dB 

9 d^* c* dp* 

(C.25) 

/ P\ dB 

(C.26) 

N = A + (l + -Jjp 


It is evident that the equation (C.24) can hold for all values of i, j and k only 
if M = 0 and N = 0. Thus the scalars A and B in the expression for the tensor 
T ik must satisfy the equations 


A - P * 


dA 

dp* 


p *_dB_ 
c 2 dp* 


(C.27) 


A + 


/ P\ dB 

l 1 + IP 


(C.28) 


Differentiating the second equation with respect to p* we obtain 


dA 1 dB / 
d^* + 7 2 d^* + V 

P\ d /dB\ 

+ 7yd^* \dpj “ 

(C.29) 

and then the first equation gives 

_d_ /dB\ 
p dp*\dp) 

(C.30) 

Eliminating A from (C.28) and (C.30) 

(p*±. 

l P <*P* 

we get 

\ dB n 
- 1) — = 0 
/ dP 

(C.31) 

Hence 

dB 
dP = 

= — Op* 

(C.32) 


where a is a constant. But owing to the relation (C.02) between the dynamical 
pressure P and the ordinary pressure p we have 


p *dP — dp 


(C.33) 
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Therefore equation (C.32) can be written as 

dB + a dp = 0 (C.34) 

and its solution gives us 

B = - ap + X (C.35) 

where X is a new constant. The quantity A may be determined from (C.28) 
and (C.32); we get 

and the expression for the mass tensor takes on the form 


(C.36) 


T ik = a |^p* ^1 + — j — pe<8(*j + XetSi* (C.37) 

If we assume that at infinity, where p* = 0 and p — 0, we have T ik — 0, then 
X = 0. With the usual choice of units one has to put a = 1 in the energy tensor 
and o = 1/c 2 in the mass tensor. This determines both constants and the 
formula for the mass tensor assumes the following final form: 

o^ptk — p* /l -|—\ iAvA — peptic (C.38) 


As a consequence of (C.03) this formula agrees with (32.28). 



APPENDIX D 


THE TRANSFORMATION OF THE 
EINSTEIN TENSOR 


(a) The Transformation of the Second Rank Curvature Tensorf 
We begin with the transformation of the covariant second rank curvature 
tensor and show that in. the expression for this tensor, it is possible to 
separate off terms containing the d’Alembert operator applied to that com¬ 
ponent of the metric tensor which has the same suffixes. 

By definition we have 

^V=^.SV ( D °D 

where, by (44.08), the fourth rank curvature tensor is 


R. 


= l(. 

a» 3v 2 \ 


8 \, L _ 8 Xn 


— a r p r° a- a r p r° 

Vao uQ va 1 Jpa u.v a 


, - *«p 8 Xa 8 \& \ 

8x x 8x & r cx v dx v 8x^ 8x {i Sx„ 8xJ yp ° L ^ va ^ ^ ^ a0 

(D.02) 

(this equation differs from (44.08) only by the naming of the indices). Therefore 


R = 1 ---ft*?.- + I (t» 3-xjis- 

l * v 2 y ox a 8 xq 2 y 8x^ 8x v 2 y 8x^ 8x^ 


dt u — 1 /iocs d X 


8x, t 8x u 

For brevity we put 

r = « r° 

1 p Jpo A 

where in agreement with the definition (41.15) 

I M = <7^3 


_ 1 ^ 9y.fr _ pp pa _ 1 _ pp p 

o " a. 9 i/po 1 [its 1 va ' nv 1 p 


(D.03) 

(D.04) 

(D.05) 


In (D.03) the first term is already in the form of the d’Alembert operator 
applied to g wJ . The remaining terms may be so transformed that the second 
derivatives of components of the metric tensor only appear through the first 
derivatives of the T p . To perform this transformation we need the relations 


and 


aT ' 0 * 5 lr 

ll [j. 

y 8 % 


r* = - 


V(-g) 8 h 


Wi-9 )‘5“ P ] 


t See also the papers by de Donder [16] and by Lanczos [17]. 


(D.06) 

(D.07) 

(D.08) 
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which, were derived in Section 41. From these relations it follows that 

r .Av l g i°g <-g) 

v 9 dx & 2 8x v 

Differentiating (D.06) with respect to x v , we get 


r 


dHog(-g) 8g*» 8g t 


a@ 


8x^ 8x v 


8x 8x 

H V 


dx. 


We note that owing to the symmetry of (D.10) in p and v we have 

d 9+ _ 8 ^ 8 9+ 


8x„ 8x„ 


8x 8x 

\X V 


Differentiating (D.09) with respect to x^ we get 

1 8 2 log (— g) er v 8g, 
— —r 


-f* 


Sx^ 8 x b 


2 8x^ 8x v 


8x„ 


8 \ 8 % 


In the last term on the right we may replace Sg.J&x^ by 
1 ( 8 9,a , 


1 (Sg^ fyA „ 1 8 9& 

2 \ 8x~ ' 3x a / ViaP T 2 & v 
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(D.09) 


( 0 . 10 ) 


(D.ll) 


(D. 12) 


(D.13) 


where F v a(3 , is the usual Christoffel symbol of the first kind (38.30). Equation 
(D.12) then appears as 


-<r* 


8 %a 
8 % 8xp 


1 8*log(-g) _8J\, 1 8grt 8g^ 

* Cl 


8x 8x 

[L V 


8x.. 


2 dx^ 8x v 


+ r „ 


8g*V 
03 dx. 


(D.14) 

We interchange the indices p and v and also the summation indices a and (3 
on the left. Then we can write 

3f 3 




8 \ 


8i c„ 8x„ 


_ 


laMog(-sr) 8Y 18g**8g afi 

2 8x^ 8x v 8x, t 2 8x v 8 Xfl 8x s 


(I).15) 

Thus the sum of the expressions (D.10), (D.14) and (D.I5), half of which enters 
(D.03), is equal to 

' 8 \» 8 X* 8 \» 


ST 


v&V 8x v 


8x^ 8 xq 


8x.. 8x, 


) 


__ffi + 54 + r ^ + r (d is. 

(at. + sxj + "•» a*. + «■•= at, ' 


(as a result of (D.ll) the remaining terms cancel). 
We introduce the notation 


r = I - r p r 

“ v 2 \8x„ 8x ) “ v p 


(D.17) 


The quantities r„ v are formed in the same way as the symmetrized covariant 
derivatives of a vector, although F v is not a vector. As a result of (D.16) th° 
expression (D.03) for R^ then becomes 


D 


■ ~aB 


8 X 


l 


8 <r* L 1 r 




_^sr r° 
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To simplify the calculations to be performed on the terms involving first deriva¬ 
tives we shall use not only the notation r v>otp and r* p for the usual Christoffel 
symbols but also the notation 

Yf = gr>g* <T fcpo (D.19) 

and 

r *,«s = (D.20) 

for the corresponding quantities with raised indices. We note that P** 1 * 13 is 
given by 

nn. »a V _ ao ^\ /non 


pn.aS _ \l 
2 V 

We consider the expression 


qvp JL -or“P — - qto 

dx p * dx p y 8x p , 


dx&dXa v« 


(D.21) 


(D.22) 


whose last term coincides with the last term in (D.18). We write this expression 
in the form 


We insert 


(D23) 


and then use the equation 


Then we get 


J'J-O _n in 

8x & ^ 11 - oS 


r =r 

p, va V, pa 


(D.24) 


(D.25) 


K = ( r o„s r v.pa + r.,.3 |?) (D. 26 ) 

Using the notation of (D.19) and relabelling, we may put 

^v = rfr a ^+rf|p (D. 27 ) 

Since the coefficient T* 0 is symmetric in a and (3 we can replace the factor 
multiplying it by its symmetric part, which is equal to idg^Jdx^. For the same 
reason we can replace Sg^Jdx^ by (D.13). Doing this we get 


(D.27) 


A T“ 0 d^ a0 I 1 Pag dffqg , n a gn 

A LLV O V C\ A U. n I A U. A V. 


But, as is easy to verify, 


Therefore 


8x^ 2 “ 8x v 


r« 0 ^ = - r ^ 
v a** a* u 


a = — - r — - r + r«T 

^ 2 v ' ap 2 li ’“ p acc v 11 v - a0 


(D.28) 


(D.29) 


(D.30) 


Equating the expressions (D.22) and (D.30) for the A we obtain the relation 


- r 3 4 I r ^ P — a *gp pa _ pagp _ a a3„pa dffpv 
2 v ' a3 8x^ i 2 "•“ p &c v 3 v “ 11 v - a0 9 9 8x & 8x a 


(D.31) 
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which enables us to write the expression for i? llv in the following final form 

J? _ 1 u oc(i _ p _ S 9y.a , p a 0 p ([) 32) 

^ s9 8x 8x “ v 99 fi" Ar + •* '>■«& ' ' 


Sxg fix. 


From this it is easy to obtain an equation for the contravariant components 
of the second order curvature tensor. 

By differentiating relations of the form (I).07) it is simple to derive the 
equation 

8Y a a s 


9\ia9\ 


8x„ 8x, 


8x a 0x g 

Using this we can write R as 


= ~ i 9^9^ 


-l r ( d9 ^ [ S 9y. a 8 9 JP \ (D.33) 

\ Sx n 8x b 0Xp 3x a / 


0y° 

0x. 8x, 


- r + r-T 


a V,a 3 


(D.34) 


If we then raise the indices p and v we get the following simple expression for 
R» v : 


R^= - } g*0 


d 2 g >x 


Sx, 8x, 


_pav _|_ pa.«3p 


<xS 


(D.35) 


The quantity 1> V is obtained from F by raising the indices according to 


pav _ gMgi°r 


(D.36) 


and may be expressed directly in terms of the F a defined by (D.08) as follows: 

(D.37) 

2 v 8x a y 8x a 8x a ) 


(b) Transformation of the Invariant 

We now form the invariant of the curvature tensor : 


We have 

11 

> 

< 

(D.38) 

R 

~ **^4.^ r + w * 

(D.39) 

where T denotes the quantity 



r = gj*" 

(D.40) 

Using the relation 

5 log i-g) 8 t'‘ 

a*. ^ dx a 

(D.41) 

we obtain from (D.37) 

r _ 0F» 1 a log (-g)^ 

Sx 2 8x 

(D.42) 


and by virtue of (41.16) 


1 ^Wi-g)-^} 

Vi-9) dx a &Zp 


(D.43) 
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Differentiating equation (D.41) with respect to x (j we obtain, in analogy with 
(D.10) 

8Y' J S 2 log(-ff) &T 8g„ v 


’^8x„ to, 




to, dx r . 


dx a 8x & 


Inserting (D.44) into (D.39) we get 


r = e2 l o g ( ~ g) - r + rf r^a +V e — 

2 to_ 8x n 2 8x„ 8x n 


(D.44) 


(D.45) 


*3 “ t/J/ 3 

We see that the second derivatives of the metric tensor enter the expression 
for R only through the second derivatives of log (—g) and through T. The 
terms involving first derivatives can be transformed with the aid of the relation 


^ aP + r -sTTx, 


,v_ 1 r v W* 
~ 2 a0 to. 


which is easily deducible from the equation 

’ 2 at. 2 9 ' , \ s sx„ +s dxj 

We now get the result 

r = I#* 8 21 ° g (-g) _ r + }pv jgf 


to, to„ 


to.. 


This expression can be written in the form 

P _ ^ 3 tog V(-9) _ pq g log V(-g ) _ r _ r. 

" OO A 


to, 8x, 


U 


to 


where 

Remembering the equation 

□y = 0 ap 


£= -jr» ^_ P .gjggV( r _g) 

0 to,. to. 


3^ 

to, to„ 


_pa 8 y 

to. 


(D.46) 

(D.47) 

(D.48) 

(D.49) 

(D.50) 

(D.51) 


which gives the d’AIembertian of any function y, we may write 

R = □ log y/(-g) - r -L (D.52) 

Of course the quantity y - log \ / (—g) is not a scalar, but formally the operator 
of' (D.51) may be applied to it. We note that both the first and the second 
term in (I).52) represent a sum of derivatives of some quantities with respect 
to coordinates, divided by y/(—g). This fact is of importance in the formulation 
of a variational principle for Einstein’s equations, when the quantity L defined 
by (D.50) plays the part of the Lagrangian (Section 60). 

Apart from (D.50) there are several other ways of writing the Lagrangian L. 
We quote the following 


L — 1 p*. gg dffgg _ p a d log -y/(— 9 ) 
to v 8x a 

and also 

L = w 

the last form is most commonly encountered in the literature. 


(D.53) 


(D.54) 
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(c) Transformation of the Einstein Tensor 

The foregoing equations enable us to write down an expression for the 
divergence-free Einstein tensor 


9 

< 

II 

1 

to 

(D.55) 

We shall see that the second derivatives of the metric tensor enter ff liV only 
through second derivatives of the quantity y/(— g) ■g*'* and through first deriva¬ 
tives of r\ It is, therefore, convenient to introduce a special symbol for \/( — g) 
times the contravariant components of the metric tensor. 

We put 


9 “ v = V(-0)-<r 

(D.56) 

Then equation (41.16) may be written as 


rv _ _ 1 8 t?" 

(D.57) 

V(~g) a** 


For the following it is convenient to transform all equations in such a way 
that they involve only derivatives of the g>* v . In performing the transformation 
we shall encounter derivatives of the quantity 


y = log V(-y) 

(D.58) 

which we shall write as follows : 


05 

1 

bO 

O 

II 

53? 

(D.59) 

According to (41.07) we have 

y = r v 

ifa av 

(D.60) 


For the quantities obtained from the y a by raising indices we introduce the 
notation 

r = 9*% (D.61) 


just as for a tensor (although of course the «/® are not a vector). We have also 




(D.62) 


where T® 9 has the meaning defined by (D.19). 
derivatives of y by y : 




8 2 Iog-y/(- g) 
dx x 8x & 


We shall denote the second 


(D.63) 


According to (33.21) the P 1 -® 0 are bilinear functions of the components 
and of their first derivatives. Inserting the expression for (p* in terms of g ,lv 
into that relation, we obtain 

]>• ® 0 = II 1 *' ® 0 + A 1 *’ ® 0 (D.64) 


where 




I( 9 ®>£^ 

2<A 8 x p 


+ 9 0p 



- 0 W 


gfl a0 \ 

SxJ 


(D.65) 


A 11 -® 3 = l (y®< 3 + yfy*“ - V^) 


(D.66) 


and 
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We shall denote the corresponding quantities with lower indices by IIand 
Kr 

We evaluate the determinant of the g'"': 

Det = { V( - 9)V Det = g* I = g (D.67) 

9 


Thus the determinant of the g MV is equal to the determinant of the g : 

Det 0 “* = Det g^=g (D. 68 ) 

From equation (D. 66 ) we obtain 

9a» A**- = - r (D.69) 

and since 9*^'' 1 ’ a0 = D 1 (D.70) 

it follows that 9 a .^ v ’’ = T" + y* (D.71) 

The last expression is also equal to 

P* + y» = -JL_ (D.72) 

dx v 

We go on to the transformation of the Einstein tensor 

G*' = — y"R (D.55) 

Our starting point is from the equations 

R^ = —\g^ —-— — P" + IX “^IX (D.35) 

dx a 8x & 

and also equation (D.49) for R, which we write in the form 

R = <T e X 3 - T“y a - r - L (D.73) 

The second derivative of g“ v is 


^V v = // av( a ^ v + „ 8 H + v + « r+vv^ 

^ W dx & + J[i a* + y “ aa; 8 + / 


ar 3 


(D.74) 


and therefore 

^ 2 9 tiV 


sx a az 3 




+ ^r 8 f^ + (Tfhl# + 9^9s) 

ox x oXq ox a / 


On the other hand we have 


(D.75) 


7X - \g^R = 


+ £<r(r“ 2 /« + r + L) - i*v + ix «3 r v @ 


(D.76) 
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By comparing the last two equations we see that apart from the terms in T 
and P“ the same combinations of second derivatives occur in both. The 
calculation gives 


1 g 2 d plv 

®" - - ~ spsr,+ +^. + r +I > 

- r KV + y* + IX (D.77) 

dx a 

As always, the transformation of the terms involving first derivatives is the 
most complicated. We have 

i>. = n«*. + a**. + a v > a 0 n^ + a«*. a0 A^ (D.78) 

Using (D.71) we get 


Ax«»n^ + A*.-»n^ = + n*.*) - ^r» + ^r*) - w 


and as a result of 

„ „ 1 da*' 1 dq'*' 

+ n** 1 vp = - g“ p —2— = —g** — - gV»ye 

q 8x. dx_ 

*7 a a 

we have 


(D.79) 

(D.80) 


A 1 *- 


I0 nj0 + A’-«*na 


a0 


= -r- 




8x_ 


X'yX - + y' T *) - W 


(D.81) 


Further 


A“’ “ 3 A^ = + h^yX (D.82) 

Hence 

8 q^ 

-r~- - X'yx - hfY - + r i>) 

V D.83) 

These relations allow us to rewrite (D.77) as 


R^ - \g^R = 


0V V 


+ - Xy‘ 


vra 9 " .- B 

+ I p'L + to** v (T“y a + T) - D" - ftyT* + y v P) (D.84) 


Here we put 


= pw + f-(y>T v + yT“) (D.85) 

and B = gJBP" = V + F-y a (D.86) 

and we express the g°® in terms of the g“ 0 in the term involving the d’Alember- 
tian. Then the expression for the Einstein tensor becomes 


- isr& = ^8 


<x 9 


a 2 gi* v 
8x_ 8x r . 


+ n* »*>n^ - \yx + hr L + \r B - 


(D.87) 
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Since, according to (D.68), the determinant g can be expressed directly in terms 
of the g^ v we can take it that in (D.87) all terms except L are expressed in terms 
of the g“ 0 and their derivatives. It remains for us to express the Lagrangian 
L also in terms of the same quantities. 

By the definition (D.50) we have 



L = - &T “ r ^ a 

(D.88) 

Here we insert 

Tip = + Ajg 

(D.89) 

where, by (D.66) 

= l(y«8g + yfil - y'g*) 

(D.90) 

Using the relation 

a* ^_ 

A° 0 ar v % 

(D.91) 

which is easy to deduce from (D.90), we get 




(D.92) 

Here we express dg^jdx^ in terms of 




(D.93) 

and use equation (D.71), which may be written in the form 



nv p = r- +y v 

(D.94) 

This gives, finally 

i „ ag a0 

L 2 V(-9) U ** ^ +iy ' y 

(D.95) 


Now only the g * 0 remain under the sign of differentiation. 
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THE CHARACTERISTICS OF THE GENERALIZED 
D’ALEMBERT EQUATION 


The generalized wave equation, or d’Alembert equation, has the form 

□<4 = 0 (E.01) 

where D<4 is the expression 

i 8 t a|c 

8Xn 


□ 4 - = 


Y ' 


V(~9) 


(E.02) 


in which the quantities g a& and g have their usual significance. 

Cauchy’s problem for the equation (E.01) consists in determining the function 
<4, if on some hypersurface 

(x{x a , x u x 2 , x 3 ) = const. (E.03) 


the values of <4 and 8 <bjc.r a are given. We assume that the equation of the 
hypersurface (E.03) can be solved for ,r 0 and that therefore 


8o) 

- #0 

8 x 0 ^ 


(E.04) 


We are interested in the possibility of solving Cauchy’s problem in some region 
sufficiently near to the hypersurface (E.03). To evaluate the fimction '4 in this 
region one has to be able to calculate the derivatives of <4 at any point of the 
hypersurface. It is easy to see that the first derivatives can be calculated 
directly from the initial conditions. For the calculation of second derivatives, 
however, one necessarily has to use the wave equation. The possibility of 
determining the second derivatives will then depend on the form of the hyper¬ 
surface to which the initial conditions refer. If the hypersurface is such that 
initial conditions relative to it do not determine the values of the second 
derivatives it is called a characteristic. A characteristic hypersurface has the 
property that on it discontinuities of the second derivatives are possible. There¬ 
fore the moving surface of a wave front must be a characteristic. 

Let us consider first the simplest case, when the initial conditions refer to the 
hypersurface x„ = const., i.e. to an initial instant of time. From given values 
of <4 and &4/&£„ all first derivatives are obtainable by direct differentiation, 
and also the second derivatives of the form 


02-4 02<4 

8xq oxi 8x( dxic 


(i, Tc — 1,2, 3) 


(E.05) 


(or rather the values of these quantities for fixed x 0 ). As for the second deriva¬ 
tive with respect to x 0 , it must be calculated from the wave equation. Explicitly 
the wave equation has the form 


,00 


flOO-1 I 

y Sx\ ^ 


= 0 


(E.06) 
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wh6re the dots denote terms containing the remaining second derivatives 
(E.05) and also the known first derivatives. Since by the nature of the metric 
tensor the quantity </°° never vanishes, being always positive, equation (E.06) 
can always be solved for the remaining second derivative 8 2 ty!8x\. This means 
that if the variable x 0 has the character of a time, the hypersurfaces x 0 — const, 
are not characteristics. 

Let us now consider the general case of a hypersurface (E.03). We introduce 
the variables 


K = “K- * 1 . x v x s) < 


(E.07) 


We denote by (]/ the quantity ^ considered as a function of the variables 
x' Q , x' v x 2 and x' r Then we have 


8<\i ck|/ 8o> 

8 x a “ 8 x' 0 8x 0 


(E.08) 


and since (E.04) holds, giving <]/ and 8 ^/8x 0 for w = const, is equivalent to 
giving <J/ and Sty'/Sx ' for x 0 — const. As before, these last quantities allow 
one to calculate all first derivatives and also the second derivatives of the 
form 


0*4/ 


ay 


(», * = 1, 2, 3) 


(E.09) 


8 x' 0 8x\ ’ 8 x\ 8x k 

whereas the second derivative with respect to x' 0 must be determined from the 
wave equation. The d’Alembertian transformed to the new coordinates has the 
form 

1 d l .. 




v(- 9 'X\ v ' x (E10) 

where the g ,a ® are obtained from the g a& according to the usual tensor trans¬ 
formation rule; in particular 

06 > 0 O) 

8 x„ 


8 x' n 8x' 

q’oo _ _o —0 _ n„ . 

y y 8x^ cte v y 8x^. 


(E.ll) 


+ 


= 0 


(E.12) 


It is, however, to be remembered that since the new variable x 0 does not 
necessarily have the character of a time the inequality g' 00 > 0 need not hold. 
The transformed wave equation takes on the form 
/ 0w 0co\ 0*4/ 

V Sx^SxJ 8x‘f 

where the terms omitted do not involve the second derivative with respect 
to xf but only second derivatives of the form (E.09) and also first derivatives. 
The values of all the omitted terms on the hypersurface w = const, may be 
taken to be known from the initial data relating to this hypersurface. 

The second derivative with respect to x' 0 remains indeterminate if and only 
if its coefficient in the wave equation (E.12) becomes zero, i.e. if the function to 
satisfies the equation 

0fO 00) 

g^ -= 0 

8x^ 

This is the equation of the characteristics for the wave equation (E.01). 


(E.13) 



433 


Appendix E 

The characteristics of the generalized wave equation are the same as the 
characteristics of the generally covariant Maxwell equations discussed in 
Section 46. Briefly, though not quite rigorously, this result may be justified 
by the following consideration. Under the condition V v 4 v = 0 Maxwell’s 
equations give rise to equations for the potentials A y in which the highest 
(second) derivatives are grouped in the form of the d’Alembertian. Hence it 
can be concluded that the characteristics of the generally covariant Maxwell 
equations have the form (E.13). The lack of rigour in this derivation consists 
in passing from the characteristics for the potentials to the characteristics for 
the field. It is not difficult to give a rigorous derivation in which one deals 
directly with the field components, similar to the procedure of Section 3 for 
cartesian coordinates in the Euclidean case. 
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INTEGRATION OF THE WAVE FRONT EQUATION 


If 


(*{x Q , x v x 2 , x 3 ) — 0 


(F.01) 


is the equation of a moving front then, as we know, the function co satisfies 
the partial differential equation 

= 0 (F.02) 

where we have put, for brevity 


do 



(F.03) 


We consider the following problem : determine the form of the wave surface 
at an instant x 0 > 0 when its form is given at the initial instant x 0 — 0. We 
solved a corresponding problem in Section 4 for the case of a Euclidean metric 
and Galilean coordinates ; we now consider it for the general case. 

Let the form of the wave surface at the initial instant be given by the equations 

x i =/iK «) 1 

x 2 =/*(«»«) [ for *o = ° (F.04) 

*» =/*(«.«) J 


These equations may be written in the more symmetric form 

*«=/«(».»); (a = 0,1,2,3) (F.05) 

where 


fo = 0 (F.06) 

Instead of initial data referring to the instant of time x 0 — 0 one can consider 
Cauchy data referring to some hypersurface; then the function / 0 will no 
longer be identically zero. 

If we insert the expressions (F.05) into the equation (F.01) we obtain an 
identity in it and v. Differentiating this identity with respect to u and to v 
we get the relations 

co a f? = 0; 0^ = 0 (F.07) 

8u 8v 


If we adjoin to these the wave front equation (F.02) we can determine from these 
three homogeneous equations the initial values of the four quantities co 0 , o lt 
o 2 and co 3 apart from a common multiplier. Denoting these initial values by 
the superfix zero we have 

= X ?a( M > v ) x = x («. v) (F.08) 

where the cp a are known functions and X is an arbitrary function of u and v. 
It should be noted that the ratios of the co® are not M ;e uniquely given; 
they may have two values, because of the fact that the wave front equation 
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(F.02) is quadratic in the to,. For a unique determination of these ratios one 
must also indicate in which of the two possible directions the wave is travelling. 

To indicate that the left-hand sides of the equations (F.05) represent the 
initial values of the coordinates we attach the superfix zero to them, writing 
them as 

*2 =/„(«,*>) (F .09) 

We take a point on the initial wave surface (to each such point there corres¬ 
ponds a definite pair of values of u and v) and consider the ray passing through 
this point. As we saw in Section 38 the differential equations of the ray are the 
Hamiltonian equations corresponding to the Hamilton-Jacobi equation (F.02). 
Thus, according to (38.41), the equations have the form 


dx_ n 


dw 

dp 


1 dg^ 

2 aT«V*. 


(F.10) 


In Section 38 it was shown that these equations are equivalent to the equations 
of a null geodesic. 

For the ray passing through the point (u, v) the initial values of the variables 
x a an d <o a are equal respectively to (F.09) and (F.08). Integrating equation 
(F.10) with the initial values x° and w® we get 

= *«Cp; ^ <*?. x l %; «®, <■>»,<<»«) 

x\, z®, x% ; to®,to®,w«,to») 

Here we insert the initial values (F.09) and (F.08) and so obtain expressions 
of the form 


X a = F J F P> «, V ) 

W a = X<I> «( X P> U > V ) 

Since the equations (F.10) do not change under the substitution 


(F.12) 


= i>' = x P (F.13) 

where X is constant along the ray, the functions F a and 3> a do not depend on 
X and p separately but only on their product \p. 

The expressions 

X a = F *(p'’ U ’ v ) (F.14) 

give the equation of a moving wave surface in parametric form. By eliminating 
the variables p', u and v one can obtain a relation between the four coordinates 
(x 0 , x lt x 2 , x 3 ) which is just the equation of a wave surface in the form (F.01) 
The proof of the relations and formulae mentioned here may be found in 
treatises on first order partial differential equations (e.g. see [8]). 

As the simplest example let us consider the equation 


1 

c® 


(to* + to* + W*) = 0 


(F.15) 


with the initial form of the wave surface given by 


z = f(x, y) for t = 0 (F.16) 

Here the cartesian coordinates x and y play the part of the parameters u and v. 
The equations (F.07) appear as 

w i + = 0 ; 


y ; ~ 0 


(F.17) 
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where f x and f y are the partial derivatives of / with respect to x and y . From 
(F.15) and (F.17) we find as one of the possible solutions 

co 0 = Xc 

hf, 

Wl V(1 +/J +/’) 

X/v (F.18) 

“ 2 VU +/I +ft) 

X 

“ 3 “ V(1 +JI +fl) 


The other solution is obtained from (F.18) by changing the sign of the square 
root. In equations (F.17) and (F.18) we should, strictly speaking, write 
instead of o> a , but we omit the superfix zero because in this example the w a 
are constant anyhow. Solving the equations 

dx„ 1 dxi 

|E ' 19 » 

and assuming that at t = 0 we have also p = 0, we get 

*0 = t = °^p ; Xf^ jq-otp (i = 1, 2, 3) (F.20) 


Here we insert the o> a from (F.18) and express the parameter p in terms of t. 
We then have for the first solution 


x x = x + 


= y + 


df x 


*3 =f(x,y) - 


V(i +/I +fl) 

Ctfy 

V(1 +/I +/J) 

ct 


(F.21) 


V(1 +/! +ft) 

For the second solution (with the opposite direction of propagation) we get 

Ctfc 


X, = X — 


x 2 = y 


r 3 =/(*, y) + 


V(1 +/J +fD 

( 'tfy 

V(1 +/I +/*) 

ct 


(F.22) 


V(1 +/I +fl) 
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In particular, if we take for the initial wave surface the surface of a sphere of 
radius a and put 

/(*, y) = vV - * 2 - y 2 ) (F.23) 

we get from (F.22) 



= y (i + ^) (F.24) 

x 3 = V(o 2 - * 2 — y 2 ) • ^ j 

and, after eliminating x and y 

x\ + x\ + x\ = (a + ct)* (F.25) 

i.e. the surface of a sphere of radius R = a + ct, as was to be expected. 
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NECESSARY AND SUFFICIENT CONDITIONS 
FOR THE EUCLIDEAN CHARACTER 
OF THREE-DIMENSIONAL SPACE 

In three-dimensional space the second rank curvature tensor Rat has the 
same number of components, namely six, as the fourth rank curvature tensor 
Ru,mk■ Therefore one should expect that not only is Rut expressible in terms of 
RiUmk by the general rule 


Rik = a lm Ru, m k (6.01) 

but that also, conversely, Ra,mk can be expressed in terms of Rik (we here 
omit the label a for components of a three-dimensional tensor). 

To find these expressions we introduce a system of antisymmetric quantities 
e ijh with s 123 = 1, similarly to our procedure in Section 22 and 37 and we 
construct an antisymmetric pseudo-tensor with the covariant components 

E m = ( G - 02 ) 

and the contra variant components 

Em = -r- (6.03) 

ya 


For transformations of the coordinates x v x 2 and x 3 that have a positive 
Jacobian 


d( X ' \ = D {x 'v x'y x' s ) > o 
W / D (x v x v x 3 ) 


(6.04) 


we have 


and therefore 


Va'-D ^ = Va 


dx' t 8x'j dx' h _ v 
m dx v d Xq te r 


(6.05) 


(6.06) 


by the rule for forming a determinant. This proves that in such transformations 
the quantities E f]h behave like a covariant tensor. By the rule for forming a 
determinant we can also write 

E iki atVak9 a lr = EP9r (6.07) 


A OO 
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where E pqr has the value (G.03). This proves that E vqr is the contravariant 
pseudo-tensor corresponding to Em. Hence it is simple to obtain the relations 

E m a kq a'' = E pqr a fp (G.08) 

E m a lr = E pqr a ip a kq (G.09) 

and 

E m = EWa ip a kq a lT (G.10) 

We note also the relation 

E^E rsj = S p S q - SfS« (G.l 1) 

which will be important for the subsequent discussion. It can be proved by the 
following reasoning. Both sides are different from zero only if p ^ q and 
r =£ s, and apart from their order the pair of numbers (p, q) must coincide with 
the pair (r, s). If p = r and q = s, both sides are equal to -f 1 and if p = s 
and q = r both are equal to —1. Therefore both sides coincide for all possible 
values of the indices, so that equation (G.l 1) is proved. 

To find the.' expression for Ru, m k in terms of Ra we introduce the contra¬ 
variant symmetric second rank tensor A pq by the equations 

A™E pil E gmk = R ily mk (G.l2) 


and then establish the relation between A pq and R pq . The equations (G.12) 
may be written in the form 


a A 11 — R«a, 
aA « = R 31 , 

23 5 aA 22 — R 31, 31 ; aA 33 — R 12 , 12 

n J31 . n/112 7? 

12 ? Uuri — x *'12> 23 ? — lx 23i 31 

(G.13) 

with 

Apq = am 

(G.14) 

These equations express the A pq directly in terms of the Ru,mk- 
Inserting (G.12) into (G.01) and using (G.09) and (G.ll) we get 



Roc — A vq (a v icttiq Optflik) 

(G.15) 

and if we put 

we have 

A = a pq A pq 

(G.16) 

whence 

Rik = Atk — atkA 

(G.17) 

and therefore 

R = aMRa = - 2 A 

(G.l 8) 


C 

i-i’-N 

1 

1 ! 

•4c 

(G.19) 


Thus Aijc is simply the three-dimensional conservative tensor. 

Inserting the corresponding contravariant tensor into (G.l2) we obtain the 
required expression of the fourth rank tensor in terms of the second rank 
tensor: 

Ril . mk ~ (R pq — J <l p<1 R)E pilEq tn k 


(G.20) 
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From the equations just obtained we can deduce the following important con¬ 
sequence. We know (Section 42) that the necessary and sufficient condition for 
the possibility of reducing a given quadratic form ds 2 to a form with constant 
coefficients is the vanishing for the fourth rank curvature tensor. This result 
obviously also applies to the purely spatial three-dimensional quadratic form 

dl 2 = aa- dxi dx * (G.21) 

for the reduction of which to the Euclidean form 

dl 2 = dx' 2 + dx 2 -f dx'f (G.22) 

the necessary and sufficient condition is the equation 

Bu, mk = 0 (G.23) 

where the quantity on the left-hand side is the three-dimensional tensor. 
But by (G.20) the fourth rank curvature tensor is, in three-dimensional space, 
expressible in terms of the second rank curvature tensor. Therefore the neces¬ 
sary and sufficient condition for the Euclidean character of a three-dimensional 
space is the vanishing of the second rank curvature tensor. 



REFERENCES 


CLASSICAL PAPERS 

[1] Einstein, A., Zur Elektrodynamik bewegter Korper, Ann. Phys. Lpz., 17, 891,1905. 

[2] Einstein, A., Die Grundlage der allgemeinen Relativitatstheorie, Ann. Phys. Lpz., 49, 

760, 1916. 

[3] Lorentz, H. A., Electromagnetic phenomena in a system moving with any velocity 

smaller than that of light, Proc. Acad. Sci. Amsterdam, 6, 809, 1904 

[4] PoiNCARf:, H., Sur la dynamique de l’dlectron, R.C. Circ. Mat. Palermo, 21,129, 1906. 
[6J Einstein, A., The, Meaning of Relativity, Princeton, 1955, 4th ed. 

[6] Lorentz, H. A., Einstein, A. and Minkowski, H., Relativitdtsthorie, Berlin, 1923. 

SPECIALIST LITERATURE 

[1] Cartan, E., La thdorie des groupes et la gdom^trie, L’enseignement math., 26,200-225, 

1927. 

Reprinted in CEuvres Completes, Part 1, Vol. II, p. 841, Paris, Gauthier-Villars, 1952. 
[la] Fokker, A. D., Albert Einstein, Inventor of Chronogeometry, Synthase, 9, 442-444, 
1955. 

[2] Alexandrov, A. D., On the meaning of the theory of relativity, Zh. Leningr. 

Univ., 8 , 103, 1953. 

[3] Alexandrov, A. D. and Ovchinnikov, V. V., Some remarks on the foundation 

of the theory of relativity, Zh. Leningr. Univ., 11 , 95, 1953. 

[4] Umov, N. A., Collected Works, Gostekhizdat, 1950. 

[4a] Umov, N. A., Ableitung der Transformationen, die mit dem Relativitatsprinzip 
vertraglich sind, Phys. Z., 11 , 908, 1910. 

[5] 'Umov, N. A., The Equation of Motion for the Energy of a Body, Odessa, 1874. 

[6] Mandelstam, L. L., Complete Collected Works, Vol. V, Lectures on the Physical 

Basis of the Theory of Relativity, published by the Academy of Sciences of the 
U.S.S.R., 1950. 

[7] Weyl, H., Mathematische Analyse des Raumproblems, Berlin, Springer, 1923. 

[8] Smirnov, V. I., A Course of Higher Mathematics, Vol. IV. 

[9] Kagan, V. F., Foundations of Geometry, Part 1, Moscow—Leningrad, 1949. 

[10] Krutkov, Yu. A., The Stress Tensor and the Solution of General Problems in the 

Static Theory of Elasticity, Akad. Nauk, U.S.S.R., 1949. 

[11] Shekhter, V. M., Zh. Leningr. Univ., No. 11, 1956. 

[12] Planck, M., Vorlesungen uber die Theorie der Wdrmeslrahlung, Leipzig, 1923. 

[13] Lebedev, P. N., Untereuchungen uber die Druckkraft des Lichtes, Ann. Phys. 

Lpz., 6 , 433, 1901. 

[14] Levi-Civtta, T., The Absolute Differential Calculus, London, 1927. 

[15] Weyl, H., Space, Time, Matter, London, Methuen, 1922. 

[16] De Donder, T., La gravifique einsteinienne, Paris, 1921. 

[17] Lanczos, C., Phys. Z., 23, 537, 1923. 

[18] Schwarzschild, K., S.B. Preuss. Akad. Ift.s.s. p. 189, 1916. 

[19] Krylov, A. N., Lectures on Approximate Calculation, Academy of Sciences of 

U.S.S.R., 1944. 

[20] Moller, C., On homogeneous gravitational fields in the general theory of 

relativity and the clock paradox, Kobenhavn, 1943. Matt.-fys. Medd., 2 (No. 19). 

[21] Liapounoff, A., Sur certaines series de figures d'equilibre d'un liquide helerog'ene en 

rotation, Leningrad, 1925 and 1927. 

[22] Liapounoff, A., Recherches dans la thdorie de la figure des corps celestes, Notes of 

the Imperial Academy of Sciences, S4r. VIII, 14, No. 7, 1903. 


441 



442 The Theory of Space Time and Gravitation 

[23] Landau, L. and Lefshitz, E., Classical Theory of Fields, Cambridge, Mass., 1951. 
[23a] Fock, V. A., Three lectures on relativity theory, Rev. Mod. Phys., 29, 325, 1957. 

[24] Friedmann, A., Uber die Kriimmung des Raumes, Z. Phys., 10, 377, 1922. 

[25] Hubble, E., Mon. Not. R. Aslron. Soc., 133, 658, 1953. 

[26] Einstein, A., Albert Einstein, Philosopher-Scientist, Library of Living Philosophers, 

1949, Illinois, U.S.A. 

[27] Maxwell, J. C., Treatise on Electricity, Oxford, 1873. 

[28] Einstein, A. and Grommer, J., Sitzb. Berl. Alcad., p. 2, 1927. 

[29] Einstein, A., Sitzb. Berl. Akad., p. 235, 1927. 

[30] Einstein, A., Infeld, L. and Hoffmann, B., The gravitational equations and the 

problem of motion, Ann. Math., 39, No. 1, 65, 1938. 

[31] Einstein, A. and Infeld, L., Ann. Math., 41, 455, 1940. 

[32] Einstein, A. and Infeld, L., On the motion of particles in general relativity theoiy, 

Canad. J. Math., 1, 209, 1949. 

[33] Infeld, L., On the motion of bodies in genera] relativity theory, Acta Phys. Polonica, 

13, 187, 1954. 

[34] Fock, V. A., On the motion of finite masses in the general theory of relativity, 

Zh. Eksp. i. Tear. Fiz., 9, 375, 1939. 

[34a] Fock, V. A., Sur le mouvement des masses finies d’aprts la th4orie de gravitation 
Einsteinienne, J. Phys. U.S.S.R., 1, 81, 1939. 

[35] Fock, V. A., The systems of Copernicus and Ptolemy in the light of the general 

theory of relativity. Symposium, Nicolas Copernicus, published by the Academy 
of Sciences of the U.S.S.R., 1947. 

[36] Fock, V. A., Some applications of the ideas of Lobachevsky’s non-Euclidean 

geometry to physics. Symposiun, by A. T. Kotelnikov and V. A. Fock, Some 
Applications of Lobachevsky's Ideas to Mechanics and Physics, Gostekhizdat, 1950. 

[37] Petrova, N., On the equations of motion and the matter tensor for a system of 

finite masses in the general theory of relativity (Dissertation, Leningrad 
University, 1940), Zh. Eksp. i. Tear. Fiz., 19, 989, 1949. 

[38] Fock, V. A., On the integrals of mass centre motion for two finite masses in the 

general theory of relativity, Dokl. Akad. Nauk U.S.S.R., 32, 28, 1941. 

Also C.R. Acad. Sci. URSS, 32, 25, 1941 (in French). 

[39] Kashkarov, Y. P., On the equations of motion of a system of finite masses in 

Einstein’s gravitational theory, Zh. Eksp. i Tear. Fiz., 27, 563, 1954. 

[40] Fichtenholz, I. G., The Lagrangian form of the equations of motion in second 

approximation, Zh. Eksp. i Tear. Fiz., 20, 233, 1950. 

[41] Papapetrou, A., Equations of motion in general relativity, Proc. Phys. Soc., A64, 

57, 1951. 

[42] Meister, H. J., Die Bewegungsgleichungen in der allgemeinen Relativitatstheorie, 

Ann. Phys. Lpz., 6th Series, 19, 268, 1957. 

[43] Todorov, I. T., On a uniqueness theorem for the wave equation (in Russian), 

Uspekhi Mat. Nauk (N.S.), 13, 211, 1958. 

[44] Fock, V. A., Einstein statics in conformal space, Zh. Eksp. i Tear. Fiz., 38, 1476, 1960. 

[45] Fock, V. A., Comparison of different coordinate conditions in Einstein’s theory of 

gravitation, Zh. Eksp. i Tear. Fiz., 38, 108, 1960. 

[46] Mossbauer, R. L., Z. Phys., 151, 124, 1958; Uspekhi Fiz. Nauk, 72, 658, 1960, transl. 

Soviet Phys. (Uspekhi), 3, 866, 1961. 

[47] Jankiewicz, Cl. Bull. int. Acad. Cracovie (Acad. pol. Sci.) Serie des Sci. Math., Astr. et 

Phys, X, 299 & 403, 1962. 



INDEX 


aberration 52-53 
absolutely rigid body 106 
accelerated reference frame 118, 230, 394 
acceleration relative to space 394 
active part of energy 112 
advantage of privileged coordinate-system 
4, 375, 402 
Aleksandrov 18 

antisymmetric tensor, dual of 65 
asymptotic form of g* 17 339-40, 349 
axial vector 57 


Beltrami coordinates 48 
Bernoulli, Johann and Daniel 113 
Bianchi identities 156 
boundary conditions (restricting equival¬ 
ence) 230, 394 
Bradley 52 


Cartan 7 

Cauchy problem (non-gravitational) 372 
characteristics, 

Einstein’s equations 192-4 
electromagnetic equations 11, 13 
generalized wave equation 431-3 
Christoffel symbols 131 
approximate 249-52 
conditions for vanishing in finite region 
147 

transformation of 146 
clock 

insensitive to impacts and accelerations 
43, 235 

paradox 234—7 
readings in different frames 37 
synchronization 30-3 
closed circuit (parallel transport round) 151 
conditions for over-all Galilean metric 150 
conformally Galilean coordinates 376, 380 
conformal space 203 
conservation laws, 
of energy 94 

integral form in general coordinate system 
175-8 

of mass 94, 113 


for point systems 

connection with integrals of mechanics 
83-6 

conventional formulation criticized 
89-91 

gravitational 349-57 
conservative systems 98 
tensor 157 

transformation of 427 
constant of gravitation, Newton’s 183 
Einstein’s 199 
continuous media 91 
contraction of tensor 126 
contravariant vector 60 

in general coordinate system 122 
convection coefficient, Fresnel’s 52 
coordinate conditions, general discussion 
374, 398 
coordinates 9 

conformally Galilean 376, 380 
Galilean 4 
locally geodesic 146 
preferred 4, 374 
Copernicus 4 
Coriolis forces 230 
cosmic time 391 
cosmological 
problems 391 
time scale 391 
covariance 

of equations 4, 54, 395 
of hydrodynamical equations in general 
coordinates 164 
of Maxwell’s equations 67, 72 
of Maxwell’s equations in general co¬ 
ordinates 162 
and relativity principle 392 
covariant differentiation 139 
change of order in 152 
examples 142 
product rule in 141 
covariant vector 60 

in general coordinate system 122 
criticism of conventional formulation of con¬ 
servation laws 89-91 
curvature tensor, transformation of 422 
cyclic derivative 145 
d’Alembert’s equation, generalized 431 


443 



The Theory of Space Time arid, Gravitation 


444 

d’Alembertian 64 

in arbitrary coordinates 143 
de Donde 193, 422 
definition of straight line 224 
deflection of light 221 
dependence of integrals on uniformity of 
space-time 349-50 

derivation of Lorentz transformation, note 
on 403 

derivative, cyclic 145 

determination of constant of gravitation 
197 

determination of distances 30 
differentiation, covariant 139 
dispersal of galaxies 392 
distance determination of 30 
divergence 

of four vector 143 
of mass tensor 244, 259 
Doppler effect 37, 188, 385, 390 
double star 317 
dual of antisymmetric tensor 65 
dust (mass tensor) 101 


Einstein 2, 4, 6, 10, 17, 25, 37, 228, 392, 
399 

and Grommer 397, 398 
tensor 157 
velocity formula 47 
Einstein’s equations 189 
characteristics of 192 
modified in cosmology? 391 
ten integrals of 349 
variational principle for 224 
“lift” 228 

criticism of 228-30 
elastic body 

internal structure 277 
mass tensor of 247 
electromagnetic equations 11 
characteristics of 11,13 
covariance of 
general 162 
rectangular 67-72 
electromagnetic, field 
energy tensor 106 

in general coordinate system 163 
motion of charge in 73 
transformation of 67 
energy 

active part of 112 

balance, role of gravitational waves in 
359-60 

conservation of 98, 111, 349, 350 
passive part of 112—13 
tensor of electromagnetic field 106 
in general coordinate system 163 


Eotvos 111 
equations 

covariance of 4, 54, 395 
of motion 

for two bodies of finite mass 311 
integrals of 83 

in general coordinates 175 
Lagrangian form of 297 
of mass point 238 
of system of charges 77 
of system of masses 263 
equivalence 

(local) of acceleration and gravitation 
228, 394 

restricted by boundary conditions 230, 
232 

of inertial and gravitational mass 110 
of mass and energy 111 
Principle of 228, 394 
purely local 394 
Euclidean 

character of three-dimensional space, con¬ 
ditions for 438 
space 1 
Euler 113 
Eulerian angles 275 

external problem for systems of bodies 267 


Fermi 228 
Fichtenholz 297, 398 
Fizeau 52 

Fokkor 7, 18 1 

force, Coriolis 230 

in motion of system 291 
Lorentz 75 

four-index symbols (curvature tensor) Rie- 
mann’s 154 

formulation of conservation laws 350 
frame, accelerated 118, 230, 394 
Fresnel’s convection coefficient 52 
Friedmann 3, 375, 380 
Friedmann-Lobachevsky space 3, 375 


Galileo 2, 19 
Galileo’s law 183 
galaxies 376 
dispersal of 392 

gamma quanta, phononless emission of 
(Mossbauer effect) 187 
gauge transformation 79 
Gauss 155, 352 

generalized wave equation, characteristics 
of 431-3 

“General Theory of Relativity” (termino¬ 
logy) criticized 373, 395 



Index 


445 


geodesic 129 

coordinate system 149 
equations 131, 132 
null 132, 133 
geometry, Euclidean 1, 9 
Lobachevsky 48 
non-Euclidean 120 
Riemannian 1, 5 

gravitation, constant of, Newton’s 183 
Einstein’s 199 
gravitational 
energy 349 

pseudo tensor of 349, 352 
equations 189 

approximate form 254 
for system of moving masses 254 
solution of, first approximation 197 
static case 203 

field, variational principle for 224-8 
mass 183, 228 
potential (Newton’s) 183-99 
vector 199 

potentials, large distance 

mixed and temporal 334-40 
spatial 330-4 

potentials, non-rotating bodies 
mixed and temporal 324-30 
spatial 318-24 
potentials in wave zone 342-9 
red shift (Newtonian approximation) 
186-9 

theory criticized 
waves 357 

roll in energy balance 359 
Grommer 397, 398 


Hamilton 14, 132, 133 
Hamilton-Jacobi equation, for light 14 
of geodesic 133, 134 
harmonic conditions 193, 263, 344, 398 
harmonic coordinates 4,144,193, 224, 232, 
315, 351 

and insular mass distributions 373 
uniqueness of 368, 373 
heliocentric frame 15, 402 
Hertz 393 
hodograph space 48 
Hoffmann 397 

homogeneous wave equation, uniqueness 
theory for 365 
Hubble’s constant 390 
Huygens 113 

hydrodynamics of perfect fluid 103, 417 


inertial 

frames 15-16, 19 


clock readings in different 37 
lengths in different 44 
transformation between 19-24 
mass 183, 228 
inertia tensor 351 
Infeld 397 

infinitesimal Lorentz transformation 65, 
98 

inhomogeneous wave equation, uniqueness 
theorem for 368 
insular mass distribution 195 
and harmonic coordinates 373 
integral form 
of conservation laws 175 
of equations of motion 277 
transformation of 283 
integrals 

characterizing internal structure 280 
of Einstein’s equations 349 
of equations of motion 83 

in general coordinate system 175 
of non-rotating bodies 307 
of systems of masses 300 
of systems of charges 83 
of motion, tensor character 86 
integration of wave front equation 434 
interactions, triple 295, 299 
internal 

problem for systems of bodies 267 
structure of elastic bodies 277 
interval 
space like 34 
time like 34 

Newtonian approximation 184 

Jacobi, Hamilton-J. equation, for light 14 
of geodesic 133, 134 
Jeans 242 
Jupiter’s satellite 39 


Kagan 48 
Kashkarov 274, 398 
Kirchhoff 365 

Krutkov’s tensor 97, 155, 351 
Krylov 221 

Lagrange equations 5 

Lagrangian form of equations of motion 297 

Lanczos 193, 422 

Landau 354, 387 

large distances, gravitational potential at 
330-40 

Lavoizier 113 
Lebedev 109 

lengths in different inertial frames 44 



The Theory of Space Time and Gravitation 


446 

length, standard of 10 
Lenin . 8 

Levi-Civita 136, 396 
Liapunov 242, 279 
Liapunov’s equation 277 
Lifshitz 354, 387 
lift experiment 
Einstein’s 228 
criticism of 228-30 
light 

deflection of 221 
Hamilton-,!acobi equation for 14 
ray 14, 435 
wave 11 

linearity of transformation between inertial 
frames 20, 403 

Lobachevsky-Einstein, velocity space 48 
Lobachevsky-Friedmann, space 3, 375 
local 

character of conditions determining 
Lorentz transformations 410 
description 3 

equivalence of acceleration and gravita¬ 
tion 228, 394 

restricted bv boundary conditions 240, 
232 

locally geodesic coordinate system 129 
Lomonosov 113 
Lorentz 2, 70, 106, 393 
Lorentz force on a moving charge 75 
Maxwell L. equations, variational prin¬ 
ciple for 165 
group 1 

transformations 26-30, 392 
infinitesimal 65 
operators of 99, 411 
notes on derivation 403 
proper and improper 30 

Mach 4 

Mach’s paradox 394 
Mandelstam 16 
mass 

centre, motion of 307 
conservation of 112 
distribution of 
insular 195, 373 
uniform 195, 376 
gravitational 183, 228 
inertial 183, 228 
mass-energy equivalence 111 
mass point, equations of motion of 238 
mass-tensor 

components as functions of state 98 
divergence of 244, 259 
examples of 101 
for elastic solid 247 


in general coordinate system 171 
proof of uniqueness 

for electromagnetic field 411 
for hydrodynamics 417 
system of equations for its components 
98 

maximum propagation speed 17 
Maxwell’s equations 11, 411 
characteristics of 11, 13 
covariance of 67 
in general coordinate system 161 
variational principle for 165 
Mayer 113 

measurement of distance 30 
measuring rod, rigid 106 
Meister 399 
Minkowski 59 

Minkowski metric, condition for 147 
misunderstanding of relativity 6 
mixed gravitational potentials 
large distances 334-40 
non-rotating bodies 234-330 
Mobius transformation 409 
“moderately large” distances 340 
Moller 231 
Mossbauer effect 187 
motion 
of charge 73 
of mass point 238 
of system of charges 77 
of system of masses 241 
Christoffel symbols for 249 
moving 

charge, Lorentz force on 75 
masses 

equation of motion for 263 
gravitational equations for 254 

Newtonian gravitational potential 183 
Newton’s equations 
rotational motion 272 
translational motion 267 
Newton’s first law 16 
non-Euclidian geometry 120 
non-rotating masses 
gravitational potentials for 318-30 
integrals of equations of motion 307 
null geodesic 132 

order, change of in covariant differentiation 
152 

orthogonal transformations 57 
Ostrogradski 3S>2 

outward radiation condition 197, 365 
over-all Minkowski metric, condition for 
147 



Index 


447 


Ovchinnikov (Aleksandrov’, collaborator) 
18 


pair creation 111 
Papapetrou 399 
paradox, clock 234-7 
Mach’s 399 
Seeliger’s 376, 392 
parallel transport 
general 136 
on surface 135 
round closed circuit 151 
parameters defining body 263 
passive part of energy 112 
perfect fluid hydrodynamics 103, 417 
in general coordinate system 164 
perihelion motion 215 
double star 317 
Petrova 398 
Planck (ref. 12) 110 

Poincare 2, 242 

Poisson’s equation 183, 189, 269, 281, 304 
polar vector 57 
postulates of relativity 16 
potential gravitational 183, 199 
retarded 78 
of isolated system 340 
Poynting’s vector 109, 415 
preferred coordinate system 4, 374 
Principle of Equivalence 228, 394 
Principle of Relativity (physical) 16-19, 
400 

for harmonic coordinate system 178 
for inertial frames 17 
“general”, impossible and unnecessary 
181-2, 393, 401 

its connection with uniformity of space 
7, 178, 180, 395 

its role in the theories of Galilean space 
and of gravitation 395 
narrowing the class of coordinate systems 
178-82 

privileged coordinate system 4 
advantages of 375, 402 
product rule in covariant differentiation 141 
proper Lorentz transformation 30 
pseudo-scalar 65 
pseudo-tensor 64 

in general coordinate system 127 
of gravitational energy 347, 352 
Ptolemy 4 

quasi-simultaneity 34 

radar method 10 


radiation pressure 110 
ray (light) 10, 14, 435 
deflection of 221 
red shift 

cosmological 383 
gravitational 186 

reference frame, accelerated 118, 230, 594 

relative velocity 45 

Relativity, 

“General Theory” (terminology) criticized 
373, 395 

misunderstanding of 6 
postulates of 16 
principle of 

(physical) 16-19, 400 
for harmonic coordinate systems 178 
for inertial frames 17 
“general” impossible and unnecessary 
181-2, 393, 401 

its connection with uniformity of space 
7, 178, 180, 395 

its role in the theory of Galilean space 
and of gravitation 395 
narrowing the class of coordinate sys¬ 
tems 178-82 
rest energy 76 
mass 76 

retarded potential 78 
of isolated system 340 
Riemannian geometry 2, 4 
Riemann’s four-index symbols (curvature 
tensor) 154 
Ricci 396 
Ricci’s lemma 141 
rigid 

body 9, 106 
measuring rod 106 
Roemer 39 

rotating frame 119, 230 

rotational motion (Newton’s equations) 272 


satellites of Jupiter 39 
scalar product 60 

in general coordinate system 126 
Schwarzschild solution (in harmonic co¬ 
ordinates) 209-15 
Seeliger 376, 392 
Shekhter (reference) 101 
simultaneity 36 
Smirnov 365 
Sobolev 365, 368 

solution of gravitation equations, first 
approximation 197 
space and time 9 
space-like interval 34 
vector 60 



The Theory of Space Time and Gravitation 


448 

space-time 

boundary conditions 370, 376, 394 
geometry 396 
metric 120 

uniformity and its connection with 
relativity principle 7, 178, 180 
spinors 58 
standard of length 10 
static case of gravitational equations 203 
straight line (definition of) 224 
summation convention 122 
synchronization of clocks 30-3 
system of bodies (moving in gravitational 
field) 241 

external problem 267 
gravitational potentials 318, 339 
integrals of motion 300, 360 
internal problem 267 
system of charges in motion 77-83 

ten integrals of Einstein’s equations 349 
tensor 

character of integrals of motion 86 
conservative (Einstein’s) 157 
transformation of 427 
contraction of 126 
curvature (Riemann’s) 154 
transformation of 42 
dual 65 

four-dimensional 59 
general definition 122 
inertia 351 

Krutkov’s 97, 155, 351 
metric 124 
three-dimensional 55 
terrestrial time 391 
time 

and space 9 
cosmic 

interval in different inertial frames 33, 
37 

-like interval 34 
-like vector 60 
measurement 10 
terrestrial 
transformation 

between inertial frames 19-24 
gauge 79 
Lorentz 26-30, 392 
of Christoffel symbols 146 
of curvature tensor 422 
of electromagnetic field 67 
of integral form of equations of motion 
283 

translational motion (Newton’s equations) 
267 


triangulation 10 

triple interactions 295, 299 

two-body problem 311 

Umov 21, 59 

Umov-Poynting vector 109, 415 
Umov’s vector 91^4, 105, 112, 198, 247 
scalar 105,112,198,247 
uniformity 
of space 1 

of space-time 178, 374 

connection with principle of relativity 
7, 178, 180, 395 

uniformly accelerated frame 230 
rotating frame 119 
uniform mass distribution 195, 376 
uniqueness 

of harmonic coordinate system 368, 373 
of mass tensor 

for electromagnetic field 411 
for hydrodynamics 417 
theorem for 

wave equation, homogeneous 365 
inhomogeneous 368 
stationary approximation 368 


variational principle 
for gravitational fields 224-8 
Maxwell-Lorentz equations 16o 
vector 

contra variant, covariant 60, 122 
divergence of 143 
four-dimensional 59 
in general coordinates 122 
parallel transport of 135 
space-like 60 
three-dimensional polar 55 
axial 57 
time-like 60 
velocity 

formula (Einstein’s) 47 
space 48 


wave equation 

characteristics of 431-3 
uniqueness theorem for 365, 368 
wave front equation, integration of 434 
waves, gravitational 357 
light 11 
surface 14 
zone 340 
weighing 113 
Weyl 21, 410 




